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DESCRIPTION 

Identification and molecular characterisation of proteins 
expressed in the tick salivary glands. 

FIELD OF INVENTION. 

The invention relates to the molecular characterisation of 
DNA sequences which encode proteins expressed in the salivary 
5 glands of ticks, more particularly the Ixodes ricinus arthropod tick. 
These proteins are involved in the complex mechanism of interaction,, 
between this arthropod and its mammalian host. The invention relates 
to newly identified polynucleotides, polypeptides encoded by them 
and the use of such polynucleotides and polypeptides, and to their 
10 production. 

BACKGROUND OF THE INVENTION. 

Ticks are ectoparasites which infest a large number of 
15 animals such as mammals, birds, reptiles and amphibians. They are 
present in almost every area of the world. The tick feeding process 
itself is often harmful to the host. In addition, many of these ticks are 
vectors of viruses, bacteria and protozoa that cause host morbidity 
and, in some cases, mortality, particularly in humans and livestock 
20 animals. There are three families of ticks : the Ixodidae or hard ticks, 
the Argasidae or soft ticks, and the Nuttalliellidae . The life cycle of 
all ticks involves four stages (egg - larva - nymph - adult). In the 
majority of species, as Ixodes ricinus, the ticks drop off the host 
animal after each blood meal. The larvae hatch from the eggs and 
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climb the vegetation where they come into easy reach of passing 
animals. Once on the host, they attach themselves and feed on blood. 
Nymphs and adults feed on other hosts and employ the same methods 
of host seeking. Mating of adults often takes place on the host while 
5 attached and feeding. Egg laying occurs after detachment. 

The tick salivary gland play an important role in the 
accomplishment of the blood meal and in the transmission of 
pathogens. During the blood meal, Ixodidae ticks concentrate the 
blood by using special mechanisms which eliminate the excess of 
10 water and ions through salivary glands. A striking modification of the 
morphology and the physiology of the salivary glands occurs during 
this blood meal. The cytoplasm and the nucleus of several salivary 
gland cells enhance in volume leading to an important increase in the 
size and the weight of the salivary glands. The messenger RNA 
15 (mRNA) synthesis is also induced, resulting in the expression of new 
proteins. At the end of the meal, the degeneration of the salivary 
glands is probably caused by the 20-hydroxyecdisone, also called 
"salivary degenerating factor'*. 

The salivary gland is rich in bio-active factors : cement, 
20 enzymes, enzyme inhibitors, histamine agonist and antagonist, 

anticoagulant factors, modulating factors of the host immune response, 
prostaglandin. Some of these interactive factors are already present in 
the salivary glands of unfed ticks; but others, mainly proteins, are 
induced during the feeding phase of the blood meal. These induced- 
25 proteinic factors seem to play an important role in the modulation of 
the host immune response. One of these factors, a 65 kDa protein, 
was isolated by Brossard and co-workers (Ganapamo et aL, 1997), 
This protein induces, in vitro, a specific CD4 + T cell proliferation of 
lymph node cells from mice infested with /. ricinus ticks (Ganapamo 
30 et aL, 1997). These cells produce high levels of interleukin-4 (IL-4) 
and low levels of interferon-y (IFN-y) when they are stimulated with 
concanavalin-A (Con A). This suggest a T H 2 polarisation of the 
cytokine pattern (Ganapamo et aL, 1995). The production of IL-5 and 
IL-10 confirms this phenomenon (Ganapamo et aL, 1996). This 
35 polarised response could constitute favourable conditions for the 
transmission of some pathogens. 
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The inhibition of the alternative pathway of complement 
activation, the decrease of the synthesis of antibodies induced by 
thymo-dependent antigens in infested animals, and the decrease of the 
proliferative activity of T-lymphocytes stimulated with mitogens, 
5 contribute to the setting up of these processes (Wikel et aL, 1996; 
Brossard and Wikel, 1997), Furthermore, prostaglandins (PGE 2 ) and 
salivary proteins are involved in the suppression of the immune 
response. In addition, it is known that some proteinic factors 
expressed by the salivary glands stimulate the growth of Borrelia 
10 burgdorferi, the causal agent of Lyme disease, which is the main 
human pathogen transmitted by the Ixodidae ticks (De Silva et al. f 
1995) 

SUMMARY OF THE INVENTION. 

15 

The Ixodes ricinus tick is the known vector of Borrelia 
burgdorferi sensu lato, the causal agent of Lyme disease. Tick 
salivary glands play an important role in the blood meal and in 
pathogen transmission. In the salivary glands, several genes are 
20 induced during the feeding process leading to the expression of new ' 
proteins which can play an important role in the modulation of the 
host immune and haemostatic responses. 

It is accordingly important to identify and characterize 
25 those genes which are induced during the feeding process as well as 
those proteins which are expressed under those circumstances and to 
take benefit of the interesting properties of the substances thus 
identified. 

30 Accordingly, in a first aspect, the present invention 

relates to a method for characterising genes which are induced in the 
salivary glands of a tick, preferably an Ixodes ricinus tick, during the 
tick feeding phase which comprises : 

35 a) selectively cloning mRNAs induced during the tick 

feeding phase to obtain a corresponding cDNA library ; 
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b) cloning full-length cDNAs corresponding to some 
incomplete cDNA sequences identified in the library obtained in step 
a). 

5 More particularly, the genes induced are those induced 

during the slow-feeding phase of the blood meal. 

In a preferred aspect, the method comprises : 

10 a) synthesising uninduced cDNAs starting from mRNAs 

expressed in the salivary gland of unfed ticks ; 

b) synthesising induced cDNAs starting from mRNAs 
expressed in the salivary gland of fed ticks ; 

15 

c) subtracting said uninduced cDNAs from said induced 

cDNAs ; 

d) isolating and cloning specifically induced cDNAs, thus 
20 obtaining a sub"$tractive library ; 

e) obtaining corresponding full-length induced cDNA ; 

f) sequencing and comparing said full-length induced 
25 DNA molecules with known-polypeptide and polynucleotide 

sequences. 

In a further preferred aspect, a full-length cDNA library 
is set up and screened by means of incomplete cDNAs isolated from 
30 the sub^tractive library. 

More particularly, the method comprises : 

a) randomly sequencing a number of clones of said 
35 sub^tractive library ; 
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b) comparing their DNA and amino acid translated 
sequences with DNA and protein databases ; 

c) identifying distinct family sequences ; 

d) characterising their corresponding full-length mRNA 

sequence. 

Other features of said method are described hereafter. 



The sequences identified by the above described method 
can be divided into three classes : i) the first one refers to putative 
anticoagulant and anticomplementary sequences; ii) the second one 
refers to putative immunomodulatory sequences; iii) the last class 
15 comprises sequences presenting low or no homologies to known 
polynucleotide and polypeptide sequences 

According to other aspects, the present invention relates 
to polynucleotides isolated from the tick salivary glands, polypeptides 
20 encoded by said polynucleotides as well as to the use of said 
polynucleotides and said polypeptides. 

Said polynucleotides, said polypeptides and said uses are 
defined and described hereafter, 

25 

BRIEF DESCRIPTION OF THE DRAWINGS. 
Figure 1 . 

30 RACE assay (Frohman et al.,1995) specific to Seq 16 and 

24. The reverse transcription step was carried out using 10 ng of 
mRNAs extracted from salivary gland of engorged ticks. The 
brightest bands represent the cDNA fragments corresponding to the 3' 
end of the targeted mRNA. The amplified products were subjected to 

35 agarose gel electrophoresis followed by staining the DNA fragments ' 
by ethidium bromide. Molecular weight markers (M) were the Smart 
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ladder (Life technologies, Rockville, Maryland, USA). Arrows 
indicate the position of the expected amplified products. 

5 

Figure 2. 

Differential expression analysis of the 5 full-length 
selected cDNAs and 9 cDNA fragment isolated in the subtractive 

10 library. PCR assays were carried out using as DNA template cDNAs 
obtained from a reverse transcription procedure on mRNAs extracted 
from salivary glands either of engorged (E) or of unfed (UF) ticks. 
These RNA messengers were also used as template in reverse 
transcription assays. Ten microliter of both PCR and RT-PCR mixture 

15 were subjected to agarose gel electrophoresis and ethidium bromide 
staining for the detection of amplified DNA products. [++] strongly 
positive; [+] positive; [-] negative. 

20 

DESCRIPTION OF THE INVENTION. 

We have characterised genes which are induced in the 
25 salivary glands of Ixodes ricinus during the slow-feeding phase of the 
blood meal. The cloning of these genes was carried out by setting up 
two complementary DNA (cDNA) libraries. The first one is a 
subtractive library based on the methodology described by Lisitsyn ei 
al. {Science 259 , 946-951,1993) and improved by Diatchenko ei al. 
30 (Proc. Natl. Acad. ScL USA 93, 6025-6030, 1996). This library cloned 
selectively induced mRNA during the tick feeding phase. The second 
library is a full-length cDNA library which has been constructed by 
using the basic property of mRNAs (presence of a polyA tail in its 
3'end and the cap structure in its 5 f end). This cDNA library permitted 
35 the cloning of full-length cDNAs, corresponding to some incomplete 
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cDNA sequences deemed of interest, and identified in the subtractive 
cDNA library. 

The subtractive library was set up by subtracting 
uninduced-cDNAs (synthesised from mRNAs expressed in the salivary 
5 glands of unfed ticks) from induced-cDNAs (synthesised from mRNAs 
expressed in the salivary gland at the end of the slow-feeding phase). 
The induced-cDNAs was digested by a restriction enzyme, divided 
into two aliquots, and distinctively modified by the addition of 
specific adapters. As for the induced-cDNAs, the uninduced cDNAs 

10 was also digested by the same restriction enzyme and then mixed in 
excess to each aliquot of modified induced-cDNA. Each mixture of 
uninduced-/induced-cDNAs was subjected to a denaturation step, 
immediately followed by an hybridisation step, leading to a capture of 
homologous induced-cDNAs by the uninduced-cDNA. Each mixture 

15 was then mixed together and subjected again to a new ; 
denaturation/hybridisation cycle. Among the hybridised cDNA 
molecules, this final mixture comprises induced-cDNAs with different 
adapters at their 5* and 3 1 end. These relevant cDNAs were amplified 
by polymerase chain reaction (PCR), using primers specific to each 

20 adapter located at each end of the cDNA molecules. The PCR 

products were then ligated into the pCRII™ vector by A-T cloning and 
cloned in an TOP-10 E. coli strain. The heterogeneity of this 
subtractive library was evaluated by sequencing the recombinant 
clones. The "induced" property of these cDNA sequences was checked 

25 by reverse transcription-PCR (RT-PCR) on mRNA extracted from 

salivary glands of engorged and unfed ticks. Finally, the full-length 
induced-cDNA was obtained by screening the expression library using, 
as a probe, some incomplete induced-cDNAs isolated from the 
subtractive library. These full-length induced DNA molecules were 

30 sequenced and compared to known polypeptide and polynucleotide 
sequences. 

The full-length cDNA library was set up by using the 
strategy developed in the "CapFinder PCR cDNA Library Construction 
Kit" (Clontech). This library construction kit utilises the unique 
35 CapSwitch™ oligonucleotide (patent pending) in the first-strand 

synthesis, followed by a long-distance PCR amplification to generate 
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high yields of full-length, double-stranded cDNAs. All commonly 
used cDNA synthesis methods rely on the ability of reverse 
transcriptase to transcribe mRNA into single stranded DNA in the 
first-strand reaction. However, because the reverse transcriptase 
5 cannot always transcribe the entire mRNA sequence, the 5' ends of 
genes tend to be under-represented in cDNA population. This is 
particularly true for long mRNAs, especially if the first-strand 
synthesis is primed with oligo(dT) primers only, or if the mRNA has a 
persistent secondary structure. Furthermore, the use of T4 DNA 
10 polymerase to generate blunt cDNA ends after second-strand synthesis 
commonly results in heterogeneous 5' ends that are 5-30 nucleotides 
shorter than the original mRNA (D'Alessio, 1988). In the CapFinder 
cDNA synthesis method, a modified oligo(dT) primer is used to prime 
the first-strand reaction, and the CapSwitch oligonucleotide acts as a 
15 short, extended template at the 5' end for the reverse transcriptase. 
When the reverse transcriptase reaches the 5* end of the mRNA, the 
enzyme switches templates and continues replicating to the end of the 
CapSwitch oligonucleotide. This switching in most cases occurs at the 
7-methylguanosine cap structure, which is present at the 5' end of all 
20 eukaryotic mRNAs (Furuichi & Miura, 1975). The resulting full- 
length single stranded cDNA contains the complete 5* end of the 
mRNA as well as the sequence complementary to the CapSwitch 
oligonucleotide, which then serves as a universal PCR priming site 
(CapSwitch anchor) in the subsequent amplification. The CapSwitch- 
25 anchored single stranded cDNA is used directly (without an 

intervening purification step) for PCR. Only those oligo(dT)-primed 
single stranded cDNAs having a CapSwitch anchor sequence at the 5' 
end can serve as templates and be exponentially amplified using the 3* 
and 5' PCR primers. In most cases, incomplete cDNAs and cDNA 
30 transcribed from polyA" RNA will not be recognised by the CapSwitch 
anchor and therefore will not be amplified. 

At the end of these reactions, the full-length cDNA PCR 
products was ligated into the pCRII cloning vector (Invitrogen) and 
used for the transformation of XL2 E. coli strain. The full-length 
35 cDNA library was then screened by using, as a probe, the incomplete 
induced-cDNAs isolated from the subtractive library. 
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Eighty-nine clones of subtractive library were randomly 
sequenced, and their DNA and amino acid translated sequences were 
compared to DNA and protein databases. Among these, 27 distinct 
family sequences were identified, and 3 of them were selected for 
5 further characterisation of their corresponding full-length mRNA 
sequence. These 3 sequences matched the sequence of i) the human 
tissue factor pathway inhibitor (TFPI), ii) the human thrombin 
inhibitor gene, and iii) a snake venom zinc dependant 
metallopeptidase protein. These genes encode proteins which could be 

10 involved in the inhibition of the blood coagulation. The other 24 

family sequences presented low or no homologies with polynucleotide 
and polypeptide sequences existing in databases. Screening of Ahe 
full-length cDNA library using oligonucleotide probes specific to the 
3 previously selected subtractive clones leJ^d to the recovery of the 

15 corresponding full-length cDNAs. Random screening of this library ■ 
led to the selection of 2 other clones. One is closely homologous to 
an interferon-like protein, whereas the other shows homologies to the 
Rattus norvegicus leucocyte common antigen related protein. 
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Definitions. 

"Putative anticoagulant, anti-complementary and 
immunomodulatory" polypeptides refer to polypeptides having the 
amino acid sequence encoded by the genes defined in the table. These 
present homologies with anticoagulant, anti-complementary and 
immunomodulatory polypeptides already existing in databases. These 
polypeptides belong to the Class I and Class II sequences (see table) 

"Putative anticoagulant, anti-complementary and 
immunomodulatory" cDNAs refer to polynucleotides having the 
nucleotide sequence defined in the table, or allele variants thereof 
and/or their complements. These present homologies with 
anticoagulant, anti-complementary and immunomodulatory 
polynucleotides already existing in databases. These cDNAs belong to 
the Class I and Class II sequences (see table) 

Some polypeptide or polynucleotide sequences present 
low or no homologies with already existing polypeptides or 
polynucleotides in databases. These belong to the Class III (see 
table). 

« Polypeptide » refers to any peptide or protein 
comprising two or more amino acids joined to each other by peptide 
bonds or modified peptide bonds, i.e., peptide isosteres. "Polypeptide" 
refers to both short chains, commonly referred to as peptides, 
oligopeptides or oligomers, and to longer chains, generally referred to 
as proteins. Polypeptides may contain amino acids other than the 20 
gene-encoded amino acids. "Polypeptides" include amino acid 
sequences modified either by natural processes, such as 
posttranslational processing, or by chemical modification techniques 
which are well known in the art. Such modifications are well 
described in basic texts and in more detailed monographs, as well as 
in a voluminous research literature. Modifications can occur 
anywhere in a polypeptide, including the peptide backbone, the amino 
acid side-chains and the amino or carboxyl termini. It will be 
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appreciated that the same type of modification may be present in the 
same or varying degrees at several sites in a given polypeptide. Also, 
a given polypeptide may contain many types of modifications. 
Polypeptides may be branched as a result of ubiquitination, and they 
5 may be cyclic, with or without branching. Cyclic, branched and 

branched cyclic polypeptides may result from posttranslational natural 
processes or may be made by synthetic methods. Modifications 
include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a hem moiety, covalent 

10 attachment of a nucleotide or nucleotide derivative, covalent 

attachment of a lipid or lipid derivative, covalent attachment of 
phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-linkings, 
formation of cystine, formation of pyroglutamate, formylation, 

15 gamma-carboxylation, glycosylation, GPI anchor formation, 

hydroxylation, iodination, methylation, myristoylation, oxidation, 
proteolytic processing, phosphorylation, prenylation, racemization, 
-selenoylation, sulfation, transfer-RNA mediated addition of amino of 
amino acids to proteins such as arginylation, and ubiquitination. See, 

20 for instance, PROTEINS - STRUCTURE AND MOLECULAR 

PROPERTIES, 2 nd Ed., T. E. Creighton, W. H. Freeman and Comany, 
New York, 1993 and Wolt, F., Posttranslational Protein Modifications 
: Perspectives and Prospects, pgs. 1-12 in POSTTRANSLATIONAL 
COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., 

25 Academic Press, New York, 1983; Seifter et aL, "Analysis for protein 
modifications and nonprotein cofactors", Meih Enzymol (1990). 182 : 
626-646 and Rattan et al, "Protein Synthesis : Posttranslational 
Modifications and Aging", Ann NY Acad Sci (1992) 663 : 48-62. 

30 "Polynucleotide'' generally refers to any 

polyribonucleotide or polydeoxyribonucleotide, which may be 
unmodified RNA or DNA or modified RNA or DNA. 
"Polynucleotides" include, without limitation single-and double- 
stranded DNA, DNA that is a mixture of single- and double-stranded 

35 regions, single- and double- stranded RNA, and RNA that is a mixture 
of single- and double-stranded regions, hybrid molecules comprising 
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DNA and RNA that may be single-stranded or, more typically, double 
stranded or a mixture of single- and double-stranded regions. In 
addition, "Polynucleotide" refers to triple-stranded regions comprisin 
RNA or DNA or both RNA and DNA. The term Polynucleotide also 
includes DNAs or RNAs containing one or more modified bases and 
DNAs or RNAs with backbones modified for stability or for other 
reasons. "Modified" bases include, for example, tritylated bases and 
unusual bases such as inosine. A variety of modifications has been 
made to DNA and RNA; thus, "Polynucleotide" embraces chemically, 
enzymatically or metabolically modified forms of polynucleotides as 
typically found in nature, as well as the chemical forms of DNA and 
RNA characteristic of viruses and cells. "Polynucleotide" also 
embraces relatively short polynucleotides, often referred to as 
oligonucleotides. 

"Variant" as the term is used herein, is a Polynucleotide 
or polypeptide that differs from a reference polynucleotide or 
polypeptide respectively, but retains essential properties. A typical 
variant of a polynucleotide differs in nucleotide sequence from 
another, reference polynucleotide. Changes in the nucleotide 
sequence of the variant may or may not alter the amino acid sequence 
of a polypeptide encoded by the reference polynucleotide. Nucleotide 
changes may result in amino acid substitutions, additions, deletions, 
fusions and truncations in the polypeptide encoded by the reference 
sequence, as discussed below. A typical variant of a polypeptide 
differs in amino acid sequence from another, reference polypeptide. 
Generally, differences are limited so that the sequences of the 
reference polypeptide and the variant are closely similar overall and, 
in many regions identical. A variant and reference polypeptide may 
differ in amino acid sequence by one or more substitutions (preferably 
conservative), additions, deletions in any combination. A substituted 
or inserted amino acid residue may or may not be one encoded by the 
genetic code. A variant ofa polynucleotide or polypeptide may be a 
naturally occurring such as an allelic variant, or it may be a variant 
that is not known to occur naturally. Non-naturally occurring variants 
of polynucleotides and polypeptides may be made by mutagenesis 
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techniques or by direct synthesis. Variants should retain one or more 
of the biological activities of the reference polypeptide. For instance, 
they should have similar antigenic or immunogenic activities. as the 
reference polypeptide. Antigenicity can be tested using standard 
immunoblot experiments, preferably using polyclonal sera against the 
reference polypeptide. The immunogenicity can be tested by 
measuring antibody responses (using polyclonal sera generated against 
the variant polypeptide) against purified reference polypeptide in a 
standard ELISA test. Preferably, a variant would retain all of the 
above biological activities. 

"Identity" is a measure of the identity of nucleotide 
sequences or amino acid sequences. In general, the sequences are 
aligned so that the highest order match is obtained. "Identity" per se 
has an art-recognized meaning and can be calculated using published 
techniques. See, e.g. : (COMPUTATIONAL MOLECULAR 
BIOLOGY, Lesk, A.M., ed., Oxford University Press,- New York, 
1988; BIOCOMPUTING : INFORMATICS AND GENOME 
PROJECTS, Smith, D.W., ed., Academic Press, New York, 1993; 
COMPUTER ANALYSIS OF SEQUENCE DATA, PART I, Griffin, 
A.M., and Griffin, H.G., eds, Humana Press, New Jersey, 1994; 
SEQUENCE ANALYSIS IN MOLECULAR BIOLOGY, von Heijne, G^ 
Academic Press, 1987; and SEQUENCE ANALYSIS PRIMER, 
Gribskov, M. and Devereux, J., eds, M Stockton Press, New York, 
1991). While there exist a number of methods to measure identity 
between two polynucleotide or polypeptide sequences, the term 
"identity" is well known to skilled artisans (Carillo, H., and Lipton, 
D., SI AM J Applied Math (1998) 48 : 1073). Methods commonly 
employed to determine identity or similarity between two sequences 
include, but are not limited to, those disclosed in Guide to Huge 
Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, 
and Carillo, H., and Lipton, D., SIAM J Applied Math (1988) 48 : 
1073. Methods to determine identity and similarity are codified in 
computer programs. Preferred computer program methods to determine 
identity and similarity between two sequences include, but are not 
limited to, GCG program package (Devereux, J., et aL y J Molec Biol 
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(1990) 215 : 403). Most preferably, the program used to determine 
identity levels was the GAP program, as was used in the Examples 
below. 

5 As an illustration, by a polynucleotide having a 

nucleotide sequence having at least, for example, 95% "identity" to a 
reference nucleotide sequence is intended that the nucleotide sequence 
of the polynucleotide is identical to the reference sequence except that 
the polynucleotide sequence may include an average up to five point 

10 mutations per each 100 nucleotides of the reference nucleotide 
sequence. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide 
sequence, up to 5% of the nucleotides in the reference sequence may 
be deleted or substituted with another nucleotide, or a number of 

15 nucleotides up to 5% of the total nucleotides in the reference sequence 
may be inserted into the reference sequence. These mutations of the 
reference sequence may occur at the 5' or 3 5 terminal positions of the 
reference nucleotide sequence or anywhere between those terminal 
positions, interspersed either individually among nucleotides in the 

20 reference sequence or in one or more contiguous groups within the 
reference sequence. 

Polypeptides of the invention. 

25 The present invention relates to proteins (or polypeptides) 

secreted by /. ricinus salivary glands. These polypeptides include the 
polypeptides encoded by the cDNAs defined in the table; as well 
polypeptides comprising the amino acid sequence encoded by the 
cDNAs defined in the table; and polypeptides comprising the amino 

30 acid sequence which have at least 75 % identity to that encoded by the 
cDNAs defined in the table over their entire length, and preferably at 
least 80 % identity, and more preferably at least 90 % identity. Those 
with 95-99 % are highly preferred. 

35 The /. ricinus salivary gland polypeptides may be in the 

form of the "mature" protein or may be a part of a larger protein such 
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as a fusion protein. It may be advantageous to include an additional 
amino acid sequence which contains secretory or leader sequences, 
pro-sequences, sequences which help in purification such as multiple 
histidine residues, or an additional sequence for stability during 
5 recombinant production. 

Fragments of the /. ricinus salivary gland polypeptides 
are also included in the present invention. A fragment is a 
polypeptide having an amino acid sequence that is the same as part, 

10 but not all, of the amino acid sequence of the aforementioned /. 

ricinus salivary gland polypeptides. As with /. ricinus salivary gland 
polypeptides, fragment may be "free-standing 5 ' or comprised within a 
larger polypeptide of which they form a part or region, most 
preferably as a single continuous region. Representative examples of 

15 polypeptide fragments of the invention, include, for example, 

fragments from about amino acid number 1-20, 21-40, 41-60, 61-80, 
81-100, and 101 to the end of the polypeptide. In this context "about" 
includes the particularly recited ranges larger or smaller by several, 5, 
4, 3, 2 or 1 amino acid at either extreme or at both extremes. 

20 

Preferred fragments include, for example, truncation 
polypeptides having the amino acid sequence of the /. ricinus salivary 
gland polypeptides, except for deletion of a continuous series of 
residues that includes the amino terminus, or a continuous series of 

25 residues that includes the carboxyl terminus and / or transmembrane 
region or deletion of two continuous series of residues, one including 
the amino terminus and one including the carboxyl terminus. Also 
preferred are fragments characterised by structural or functional 
attributes such as fragments that comprise alpha-helix and alpha-helix 

30 forming regions, beta-sheet and beta-sheet forming regions, turn and 
turn-forming regions, coil and coil-forming regions, hydrophilic 
regions, hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surface-forming regions, 
substrate binding region, and high antigenic index regions. Other 

35 preferred fragments are biologically active fragments. Biologically 
active fragments are those that mediate /. ricinus salivary gland 
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protein activity, including those with a similar activity or an improved 
activity, or with a decreased undesirable activity. Also included are 
those that antigenic or immunogenic in an animal or in a human. 

5 Preferably, all of these polypeptide fragments retain parts 

of the biological activity (for instance antigenic or immunogenic) of 
the /. ricinus salivary gland polypeptides, including antigenic activity. 
Variants of the defined sequence and fragments also form part of the 
present invention. Preferred variants are those that vary from the 

10 referents by conservative amino acid substitutions - i.e., those that 

substitute a residue with another of like characteristics. Typical such 
substitutions are among Ala, Val, Leu and He; among Ser and Thr; 
among the acidic residues Asp and Glu; among Asn and Gin; and 
among the basic residues Lys and Arg; or aromatic residues Phe and 

15 Tyr. Particularly preferred a^re variants in which several, 5-10, 1-5, 
or 1-2 amino acids are substituted, deleted, or added in any 
combination. Most preferred variants are naturally occurring allelic 
variants of the /. ricinus salivary gland polypeptide present in /. 
ricinus salivary glands. 

20 

The /. ricinus salivary gland polypeptides of the invention 
can be prepared in any suitable manner. Such polypeptides include 
isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides 
25 produced by a combination of these methods. Means for preparing 
such polypeptides are well understood in the art. 

Polynucleotides of the invention, 

30 Another aspect of the invention relates to /. ricinus 

salivary gland cDNAs (polynucleotides). These include isolated 
polynucleotides which encode /. ricinus salivary gland polypeptides 
and fragments respectively, and polynucleotides closely related 
thereto. More specifically, /. ricinus salivary gland cDNAs of the 

35 invention include a polynucleotide comprising the nucleotide sequence 
of cDNAs defined in the table, encoding a /. ricinus salivary gland 
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polypeptide. The /. ricinus salivary gland cDNAs further include a 
polynucleotide sequence that has at least 75% identity over its entire 
length to a nucleotide sequence encoding the /. ricinus salivary gland 
polypeptide encoded by the cDNAs defined in the table, and a 
5 polynucleotide comprising a nucleotide sequence that is at least 75% 
identical to that of the cDNAs defined in the table. In this regard, 
polynucleotides at least 80% identical are particularly preferred, and 
those with at least 90% are especially preferred. Furthermore, those 
with at least 95% are highly preferred and those with at least 98-99% 

10 are most highly preferred, with at least 99% being the most preferred, 
with at least 99% being the most preferred. Also included under /. 
ricinus salivary gland cDNAs is a nucleotide sequence which has 
sufficient identity to a nucleotide sequence of a cDNA defined in the 
table to hybridize under conditions useable for amplification or for 

15 use as a probe or marker. The invention also provides polynucleotides 
which are complementary to such /. ricinus salivary gland cDNAs. 

The nucleotide sequence encoding /. ricinus salivary 
gland polypeptide encoded by the cDNAs defined in the table may be 
20 identical to the polypeptide encoding sequence contained in the genes 
defined in the table, or it may be a sequence, which as a result of the 
redundancy (degeneracy) of the genetic code, also encodes the 
polypeptide encoded by the genes defined in the table respectively. 

25 When the polynucleotides of the invention are used for 

the recombinant production of an /. ricinus salivary gland polypeptide, 
the polynucleotide may include the coding sequence for the mature 
polypeptide or a fragment thereof, by itself; the coding sequence for 
the mature polypeptide or fragment in reading frame with other coding 

30 sequences, such as those encoding a leader or secretory sequence, a 
pre-, or pro-or preproprotein sequence, or other fusion peptide 
portions. For example, a marker sequence which facilitates 
purification of the fused polypeptide can be encoded. In certain 
preferred embodiments of this aspect of the invention, the marker 

35 sequence is a hexa-histidine peptide, as provided in the pQE vector 
(Qiagen, Inc.) and described in Gentz et al, Proc Natl Acad Sci USA 
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(1989) 86:821-824, or is an HA tag, or is glutathione-s-transferase. 
The polynucleotide may also contain non-coding 5' and 3' sequences, 
such as transcribed, non-translated sequences, splicing and 
polydenylation signals, ribosome binding sites and sequences that 
5 stabilize mRNA. 

Further preferred embodiments are polynucleotides 
encoding /. ricinus salivary gland protein variants comprising the 
amino acid sequence of the /. ricinus salivary gland polypeptide 

10 encoded by the cDNAs defined by the table respectively in which 
several, 10-25, 5-10, 1-5, 1-3, 1-2 or 1 amino acid residues are 
substituted, deleted or added, in any combination. Most preferred 
variant polynucleotides are those naturally occurring /. ricinus 
sequences that encode allelic variants of the /. ricinus salivary gland 

15 proteins in /. ricinus. 

The present invention further relates to polynucleotides 
that hybridize to the herein above-described sequences. In this regard, 
the present invention especially relates to polynucleotides which 
20 hybridize under stringent conditions to the herein above-described 
polynucleotides. As herein used, the term "stringent conditions" 
means hybridisation will occur only if there is at least 80%, and 
preferably at least 90%, and more preferably at least 95%, yet even 
more preferably 97-99% identity between the sequences. 

25 

Polynucleotides of the invention which are identical or 
sufficiently identical to a nucleotide sequence of any gene defined in 
the table or a fragment thereof, may be used as hybridisation probes 
for cDNA clones encoding /. ricinus salivary gland polypeptides 

30 respectively and to isolate cDNA clones of other genes (including 
cDNAs encoding homologs and orthologs from species other than /. 
ricinus) that have a high sequence similarity to the /. ricinus salivary 
gland cDNAs. Such hybridisation techniques are known to those of 
skill in the art. Typically these nucleotide sequences are 80% 

35 identical, preferably 90% identical, more preferably 95% identical to 
that of the referent. The probes generally will comprise at least 15 



19 



nucleotides, preferably, such probes will have at least 30 nucleotides 
and may have at least 50 nucleotides. Particularly preferred probes 
will range between 30 and 50 nucleotides. In one embodiment, to 
obtain a polynucleotide encoding /. ricinus salivary gland polypeptide, 
5 including homologs and orthologs from species other than /. ricinus, 
comprises the steps of screening an appropriate library under stringent 
hybridisation conditions with a labelled probe having a nucleotide 
sequence contained in one of the gene sequences defined by the table, 
or a fragment thereof; and isolating full-length cDNA clones 

10 containing said polynucleotide sequence. Thus in another aspect, /. 

ricinus salivary gland polynucleotides of the present invention further 
include a nucleotide sequence comprising a nucleotide sequence that 
hybridise under stringent condition to a nucleotide sequence having a 
nucleotide sequence contained in the cDNAs defined in the table, or a 

15 fragment thereof. Also included with /. ricinus salivary gland 
polypeptides are polypeptides comprising amino acid sequences 
encoded by nucleotide sequences obtained by the above hybridisation ■ 
conditions. Such hybridisation techniques are well known to those of 
skill in the art. Stringent hybridisation conditions are as defined above 

20 .or, alternatively, conditions under overnight incubation at 42°C in a 
solution comprising : 50% formamide, 5xSSC (150mM NaCl, 15mM - 
trisodium citrate), 50mM sodium phosphate (pH7.6), 5x Denhardt's ;z 
solution, 10% dextran sulfate, and 20 microgram/ml denatured, 
sheared salmon sperm DNA, followed by washing the filters in 

25 O.lxSSC at about 65°C. 



The polynucleotides and polypeptides of the present 
invention may be employed as research reagents and materials for 
discovery of treatments and diagnostics to animal and human disease. 

30 

Diagnostic Assays 

This invention also relates to the use of /. ricinus salivary 
gland polypeptides, or /. ricinus salivary gland polynucleotides, for 
35 use as diagnostic reagents. 
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Materials for diagnosis may be obtained from a subject's 
cells, such as from blood, urine, saliva, tissue biopsy. 

Thus in another aspect, the present invention relates to a 

c f \ 

5 diagnostic kit for a disease or sus^e^t5,bility to a disease which 
comprises : 

(a) an /. ricinus salivary gland polynucleotide, preferably 
the nucleotide sequence of one of the gene sequences defined by the 
table, or a fragment thereof; 
10 (b) a nucleotide sequence complementary to that of(a); 

(c) an /. ricinus salivary gland polypeptide, preferably the 
polypeptide encoded by one of the gene sequences defined in the 
table, or a fragment thereof; 

(d) an antibody to an /. ricinus salivary gland 

15 polypeptide, preferably to the polypeptide encoded by one of the gene 
sequences defined in the table; or 

(e) a phage displaying an antibody to an /. ricinus 
salivary gland polypeptide, preferably to the polypeptide encoded by 
one of the cDNAs sequences defined in the table. 

20 



It will be appreciated that in any such kit, (a), (b), (c), (d) 
or (e) may comprise a substantial component. 



The anti-7. ricinus salivary gland polypeptide antibodies may comprise polyclonal antibodies. 
Methods of preparing polyclonal antibodies are known to the skilled artisan. Polyclonal antibodies 
can be raised in a mammal for example, by one or more injections of an immunizing agent and, if 
desired, an adjuvant. Typically, the immunizing agent and/or adjuvant will be injected in the 
mamma! by multiple subcutaneous or intraperitoneal injections. The immunizing agent may include 
the /. ricinus salivary gland polypeptide or a fusion protein thereof. It may be useful to conjugate the 
immunizing agent to a protein known to be immunogenic in the mammal being immunized. 
Examples of such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, 
serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which 
may be employed include Freund's complete adjuvant and MPL TDM adjuvant. The immunization 
protocol may be selected by one skilled in the art without undue experimentation. 

The anti-7. ricinus salivary gland polypeptide antibodies may, alternatively, be monoclonal 
antibodies. Monoclonal antibodies may be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature. 256:495 (1975). In a hybridoma method, a mouse, 
hamster, or other appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to the "~ 
immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. 
The immunizing agent will typically include the I. ricinus salivary gland polypeptide or a fusion 
protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human . 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources are 
desired. The lymphocytes are then fused with an immortalized cell line using a suitable fusing agent, 
such as polyethylene glycol, to form a hybridoma cell [Goding, Monoclonal Antibodies: Principles 
and Practice Academic Press, (1986) pp. 59-103]. Immortalized cell lines are usually transformed 
mammalian cells, particularly myeloma cells of rodent, bovine and human origin. Usually, rat or 
mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
medium that preferably contains one or more substances that inhibit the growth or survival of the 
unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will 
include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the 
growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression 
of antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT 
medium. More preferred immortalized cell lines are murine myeloma lines, which can be obtained, 
for instance, from the Salk Institute Cell Distribution Center, San Diego, California and the American 
Type Culture Collection, Rockville, Maryland. Human myeloma and mouse-human heteromyeloma 
cell lines also have been described for the production of human monoclonal antibodies [Kozbor, J 
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Immunol , 133 :3001 (1984); Brodeur et al., Monoclonal Antibody Production Techniques and . 
Applications Marcel Dekker, Inc., New York, (1987) pp. 51-631. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence 
of monoclonal antibodies directed against / ricinus salivary gland polypeptide. Preferably, the 
binding specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (R1A) or enzyme- 
linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the art. The 
binding affinity of the monoclonal antibody can, for example, be determined by the Scatchard 
analysis of Munson and Pollard, Anal Biochem ., 107:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 
procedures and grown by standard methods [Goding, supra] . Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. Alternatively' 
the hybridoma cells may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture 
medium or ascites fluid by conventional immunoglobulin purification procedures such as, for 
example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or 
affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those 
described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention 
can be readily isolated and sequenced using conventional procedures (e.g., by using oligonucleotide 
probes that are capable of binding specifically to genes encoding the heavy and light chains of 
murine antibodies). The hybridoma cells of the invention serve as a preferred source of such DNA. 
Once isolated, the DNA may be placed into expression vectors, which are then transfected into host 
cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not 
otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. The DNA also may be modified, for example, by substituting the coding 
sequence for human heavy and light chain constant domains in place of the homologous murine 
sequences [U.S. Patent No. 4,8 1 6,567; Morrison et al., supra ] or by covalently joining to the 
immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin 
polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant domains of 
an antibody of the invention, or can be substituted for the variable domains of one antigen-combining 
site of an antibody of the invention to create a chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are 
well known in the art. For example, one method involves recombinant expression of immunoglobulin 
light chain and modified heavy chain. The heavy chain is truncated generally at any point in the Fc 
region so as to prevent heavy chain crosslinking. Alternatively, the relevant cysteine residues are 
substituted with another amino acid residue or are deleted so as to prevent crosslinking. 
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In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly, Fab fragments, can be accomplished using routine 
techniques known in the art. 

As an alternative or supplement to immunising a mammal with a peptide, an antibody specific for a 
protein may be obtained from a recombinantly produced library of expressed immunoglobulin 
variable domains, e.g. using lambda bacteriophage or filamentous bacteriophage which display 
functional immunoglobulin binding domains on their surfaces; for instance see WO92/01047. The 
library may be naive, that is constructed from sequences obtained from an organism which has not 
been immunised with any of the proteins (or fragments), or may be one constructed using sequences 
obtained from an organism which has been exposed to the antigen of interest. 



Vaccines. 

Another aspect of the invention relates to a method for 
inducing an immunological response in a mammal which comprises - 
inoculating the mammal with /. ricinus salivary gland polypeptide or 
epitope-bearing fragments, analogs, outer-membrane vesicles or cells 
(attenuated or otherwise), adequate to produce antibody and/or T cell 
immune response to protect said animal from bacteria and viruses 
which could be transmitted during the blood meal of I. ricinus and 
related species. In particular the invention relates to the use of /. 
ricinus salivary gland polypeptides encoded by the cDNAs defined in 
the table. Yet another aspect of the invention relates to a method of 
inducing immunological response in a mammal which comprises, 
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delivering /. ricinus salivary gland polypeptide via a vector directing 
expression of /. ricinus salivary gland polynucleotide in vivo in order 
to induce such an immunological response to produce antibody to 
protect said animal from diseases (Lyme disease, tick encephalitis 
5 virus disease, ). 

A further aspect of the invention relates to an 
immunological composition or vaccine formulation which, when 
introduced into a mammalian host, induces an immunological response 

10 in that mammal to a /. ricinus salivary gland polypeptide wherein the 
composition comprises a /. ricinus salivary gland cDNA, or /. ricinus 
salivary gland polypeptide or epitope-bearing fragments, analogs, 
outer-membrane vesicles or cells (attenuated or otherwise), the 
vaccine formulation may further comprise a suitable carrier. The /. 

15 ricinus salivary gland polypeptide vaccine composition is preferably 
administered orally or parenterally (including subcutaneous, 
intramuscular, intravenous, intradermal etc. injection). Formulations 
suitable for parenteral administration include aqueous and non- 
aqueous sterile injection solutions which may contain anti-oxidants, 

20 buffers, bacteriostats and solutes which render the formulation iotonic 
with the blood of the recipient; and aqueous and non-aqueous sterile 
suspensions which may include suspending agents or thickening 
agents. The formulations may be presented in unit-dose or multi-dose 
containers, for example; sealed ampoules and vials and may be stored 

25 in a freeze-dried condition requiring only the addition of the sterile 
liquid carrier immediately prior to use. The vaccine formulation may 
also include adjuvant systems for enhancing the immunogenicity to the 
formulation, such as oil-in water systems and other systems konwn in 
the art. The dosage will depend on the specific activity of the vaccine 

30 and can be readily determined by routine experimentation. 

Yet another aspect relates to an immunological/vaccine 
formulation which comprises the polynucleotide of the invention. 
Such techniques are known in the art, see for example Wolff et al t 
35 Sciences, (1990) 247 : 1465-8. 
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Therapeutics. 

Another aspect of the invention relates to the use of these 
/. ricinus salivary gland polypeptides as therapeutics agent. In 
5 considering the particular potential therapeutic areas for the likely 
products, the hospital disciplines cover by these products are : 
hematology (particularly coagulation clinics), transplantation (for 
immunosuppression control), rheumatology (for anti-inflammatories), 
and general treatment (for specific or improved anesthetics). 



10 



Table: Sequences identified in the subtractive and the cDNA full-length librarie: 
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EXAMPLES 

Biological materials used in this study. 

The salivary glands of 5 day engorged or unfed free of 
pathogen Ixodes ricinus female adult ticks were used in this work. 
When removed, these glands were immediately frozen in liquid 
nitrogen and stored at -80°C. To extract RNA messengers (mRNA), 
the salivary glands were crushed in liquid nitrogen using a mortar and 
a pestle. The mRNAs were purified by using an oligo-dT cellulose 
(Fast Track 2.0 kit, Invitrogen, Groningen, The Netherlands). Two 
micrograms of mRNAs were extracted from 200 salivary glands of fed 
ticks; and 1.5 fig of mRNAs were also extracted from 1,000 salivary 
glands of unfed ticks. 

Example 1 : Construction of a Representational 
Difference Analysis (RDA) subtractive library. 

All procedures were performed as described by Hubank • 
and Schatz (1994). Double-stranded cDNAs were synthesised using 
the Superscript Choice System (Life Technologies, Rockville, 
Maryland, USA). The cDNAs were digested with DpnII restriction 
enzyme, ligated to R-linkers, amplified with R-24 primers (Hubank 
and Schatz, 1994), and finally digested again with the same enzyme to 
generate a "tester" pool consisting of cDNAs from salivary glands of 
fed ticks and a "driver" pool consisting of cDNAs from salivary 
glands of unfed ticks. The first round of the subtractive hybridisation 
process used a tester/driver ratio of 1:100. The second and third 
rounds utilised a ratio of 1:400 and 1:200,000, respectively. After 
three cycles of subtraction and amplification, the DpnII-digested 
differential products were subdivided according to size into 4 different 
fractions on a 1.7% electrophoresis agarose gel, and subcloned into 
the BamHI site of the pTZ19r cloning vector. The ligated product was 
used to transform TOP-10 E. coli competent cells (Invitrogen, 
Groningen, The Netherlands). Nine thousand six hundred clones of 
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this subtractive library were randomly selected, and individually put 
in 100 microplates and stored at -80°C. This subtractive library was 
analysed by sequencing 89 randomly chosen clones, using M13 
forward and reverse primers specific to a region located in the pT19r 
5 cloning vector. The DNA sequences of these 89 clones were compared, 
and 27 distinct family sequences were identified. Homology of these 
sequences to sequences existing in databases is presented in Table 1. 
The subtractive sequences 1 to 27 are presented in the sequence-listing 
file (except for sequences 16 and 24 whose complete mRNA sequences 

10 are presented; see section, Example 2). Three sequences (Seq 7, 16 
and 24) were selected for further characterisation of their 
corresponding full-length mRNA sequence. These 3 sequences 
matched the sequence of i) the human tissue factor pathway inhibitor 
(TFPI), ii) a snake venom zinc dependant metallopeptidase protein, 

15 and iii) the human thrombin inhibitor protein, corresponding to Seq 7, 
16 and 24, respectively. These genes encode proteins which could be 
involved in the inhibition of the blood coagulation. The other 23 
family sequences presented low or no homologies with polynucleotide 
and polypeptide sequences existing in databases 

20 

Example 2 : Construction of the full length cDNA library 
and recovery of full length cDNAs sequences by screening of this full 
length cDNA library. 

25 

This library was set up using mRNAs extracted from 
salivary glands of engorged ticks. The mRNAs (80 ng) were subjected 
to reverse transcription using a degenerated oligo-dT primer (5'- 
A(T)30VN-3'), the SmartTM oligonucleotide (Clontech, Palo Alto, 

30 USA), and tthe Superscript II reverse transcriptase (Life Technologies, 
Rockville, Maryland, USA). The single strand cDNA mixture was used 
as template in a hot start PGR assay including the LA Taq polymerase 
(Takara, Shiga, Japan), the modified oligo-dT primer and a 3 '-'Smart' 
primer specific to a region located at the 5' end of the SmartTM 

35 oligonucleotide. The PGR protocol applied was : I min at 95°C, 

followed by 25 sec at 95°C / 5 min at 68°C, 25 times, and 10 min at 
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72°C. The amplified double stranded cDNA mixture was purified with 
a Centricon 30 concentrator (Millipore, Bedford, USA). The cDNAs 
were divided into 4 fractions ranging from 0.3 to 0.6 kb, 0.6 to 1 kb, 1 
kb to 2 kb and 2 kb to 4 kb on a 0,8% high grade agarose 
electrophoresis gel and recovered separately by using the Qiaex II 
extraction kit (Qiagen, Hilden, Germany). The 4 fractions were ligated 
individually into the pCRII cloning vector included in the TOPO 
cloning kit (Invitrogen, Groningen, The Netherlands). The ligated 
fractions were then used to transform XL2-Blue ultracompetent E. coli 
cells (Stratagene, Heidelburg, Germany). The resulted recombinant 
clones were stored individually in microplates at -80°C. Ten clones 
were randomly chosen for partial or complete sequencing. As a result 
of this procedure, 2 cDNA sequences (Seq 28 and Seq 29, see Table 1) 
were selected for their homology to sequence databases. One is 
closely homologous to an interferon-like protein (Seq 28), whereas th : e 
other shows homologies to the Rattus norvegicus leucocyte common 
antigen-related protein (Seq 29). 

The 4 different fractions of the full-length cDNA library 
were screened with radio-labelled oligonucleotide probes. The 
labelling of these oligo probes was performed as described in Current 
Protocols in Molecular Biology (Ausubel et al, 1995, J. Wiley and ^* 
sons, Eds). These 4 different fractions were then plated on 
nitrocellulose membranes and grown overnight at 37°C. These 
membranes were denatured in NaOH 0.2M / NaCl L5M, neutralised in 
Tris 0.5M pH 7.5-NaCl 1.5M and fixed in 2X SSC (NaCl 0.3 M/ Citric 
Acid Trisodium di-hydrated 0.03 M). The membranes were heated for 
90 min. at 80°C, incubated in a pre-hybridisation solution (SSC 6X, 
Denhardt's 10X, SDS 0,1%) at 55°C for 90 min., and finally put 
overnight in a preheated hybridisation solution containing a specific 
radio-labelled oligonucleotide probe at 55 °C. The hybridised 
membranes" were washed 3 times in a SSC 6X solution at 55 °C for 10 
min, dried and exposed on Kodak X-OMAT film overnight at -80°C. 
In this way, the complete consensus mRNA sequence of the Seq 28 
and 29 was confirmed by the recovery of two other clones 
corresponding to these sequences. Only one full-length cDNA clone 
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corresponding to the subtractive clone 16 was isolated. Therefore, to 
identify the complete sequence of the Seq 16 and 24, the Rapid 
Amplification of cDNA Ends (RACE) method was applied. 

The RACE methodology was performed as described by , 
Frohman et al. (1995). The reverse transcription step was carried out 
using 10 ng of mRNAs extracted from salivary gland of engorged ticks 
and the Thermoscript Reverse transcriptase (Life technologies, 
Rockville, Maryland, USA). All gene specific primers (GSP) had an 
18 base length with a 61% G/C ratio. The amplified products were 
subjected to an agarose gel electrophoresis and recovered by using an. 
isctachophorese procedure. The cDNAs were cloned into the pCRII- 
TOPO cloning vector (Invitrogen, Groningen, The Netherlands). To 
identify the consensus cDNA sequence,, different clones were 
sequenced., and their sequence was compared to their known 
corresponding sequence. Therefore, the complete cDNA sequences of 
the clones 16 and 24 isolated in the subtractive library were obtained 
by this RACE procedure (figure 1). 

Example 4 ; Evaluation of the differential expression of 
the cDNA clones isoalted in the subtractive and full-length cDNA . 
libraries. 

The differential expression of the mRNAs corresponding 
to the 5 full-length selected clones (Seq 7, 16, 24, 28 and 29) and of 9 
subtractive clones was assessed using a PCR and a RT-PCR assays 
(figure 2). 

The PCR assays were carried out using as DNA template 
cDNAs obtained from a reverse transcription procedure on mRNAs 
extracted from salivary glands either of engorged or of unfed ticks. 
Each PCR assay included pair of primers specific to each target 
subtractive or cDNAs full-length sequence. PCR assays were 
performed in a final volume of 50 ^1 containing 1 primers, 0.2 mM 
deoxynucleotide (dATP, dCTP, dGTP and dTTP; Boehringer 
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Mannheim GmbH, Mannheim, Germany), PCR buffer (10 mM Tris- 
HC1, 50 mM KC1 , 2.5 mM. MgC12 , pH 8.3) and 2.5 U of Taq DNA 
polymerase (Boehringer mannheim GmbH, Mannheim, Germany). 
DNA samples were amplified for 35 cycles under the following 
5 conditions : 94°C for 1 min., 72 °C for 1 min. and 64 °C for 1 min, 
followed by a final elongation step of 72 °C for 7 min. 

The RT-PCR assay was carried out on the 5 selected full- 
length cDNA clones and on 5 cDNA subtractive clones. The mRNAs 

10 used as template in the reverse transcription assay was extracted from 
salivary glands of engorged and unfed I. ricinus ticks. The reverse 
transcription assays were performed using a specific primer (that 
target one the selected sequences) and the "Thermoscript Reverse 
transcriptase" (Life technologies, Rockville, Maryland, USA) at 60°C 

15 for 50 min. Each PCR assay utilised the reverse transcription specific" 
primer and an another specific primer. The PCR assays were 
performed in a final volume of 50 jil containing IjiM primers, 0.2 mM 
deoxynucleotide (dATP, dCTP, dGTP and dTTP; Boehringer - 
Mannheim GmbH, Mannheim, Germany), PCR buffer (10 mM Tris- 

20 HC1, 50 mM KC1 , 2.5 mM. MgC12 , pH 8.3) and 2.5 U of Expand 
High Fidelity polymerase (Roche, Bruxelles, Belgium).* Single 
stranded DNA samples were amplified for 3 0 cycles under the fck: 
following conditions : 95°C for 1 min., 72 °C for 30 sec. and 60 °C for 
1 min, followed by a final elongation step of 72 °C for 7 min. The 

25 figure 2 shows that the expression of the selected sequences is induced 
in salivary glands of 5 day engorged ticks, except for the sequence 28 
that is expressed at a similar level in salivary glands of engorged and 
unfed ticks. 
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SEQUENCE LISTING 

( 1 ) INFORMATION FOR SEQ ID N°: 1 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 194 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N° 1 : 

1 ATACCTTCCA CTTGTAGCCC TTCCTCATCC GATATGGTGA CGGATGCCAT 
15 51 TGCATCCTCG TCGTGGAAGA GGTCCTCTTC TAAATAAGAC CCATCCATAT 

101 ATGTGTGTTT GCGAATGCCG TCGACGTAGC TCCTGACTAG AAACTCGTCG 
151 GCTAGGACAG AACTTTTCTT CAGGTTTAGC GTAATGTCCT CGTT 

20 

(2) INFORMATION FOR SEQ ID N°: 2 

25 (i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 607 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

30 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°2 : 

35 1 TACCNGGGAA TCCAAAACCA ATTTTTATTG GAACTTCCAC GTCTTCTTCA 

51 AGGCGGTGGC ACCTCTGCAT TTATGAAGTT CGTCTTGGCA TTTTATTTTT 
101 TGCTTCTTTC ATTGCRGAAC TCGCAAATGC ACTTCCCGTG CTTGTCGCAT 

40 

151 TTCGCCCCAA AAGCGCATGG CATTCCTTCC GGCAGATTAA CTTTTTCAAA 
201 TTCACGGTTC TGAACCAATA ATAGATCGTG GCAATGTTTG TGCTGTTTGC 
45 251 GATTTGCAAA CCAGCTGTAG CCACCATTGG ACTCAAAGGT GCGCACAACA 

301 TGGCGCCGAA CTGTGAAAAA CAAATTAAGG CTNCTTTGTA ATAACGCTAG 
351 TCTTGGTACG CCGTTAGAGG TCGATGTCGC GCCTCGCGAT TGCAAAGTCA 

50 

4 01 CTTGCACTTA TCAAGCTCCT GGAGAAAAAT GGGTGCAACG GGGGGATCAG 
4 51 CGTTTGTACT TGCAAACATT TGTGGAGACG GTAAACCWGT ATTTCGCGGA 
55 5 01 ACTCAGATGC TCCAGCGTGA AGCTCGTCTT AATAAAAGTT GTAAATTCGA 
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551 GTATNGATGA AGAACTGAAA TTCGAGGCAT TTAGAAACAC CACGAGAAGC 
601 AGCGGAA 

5 

(3) INFORMATION FOR SEQ ID N°: 3 

(i) SEQUENCE CHARACTERISTICS : 
10 (a) LENGTH : 259 base pairs 

(b) TYPE : nucleic acid 

(c) STRAND EDNESS : single 

(d) TOPOLOGY : linear 

15 (ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°3 : 

1 GATCCTACGC CTGAAAA.TGA GTGTCCATCG TCTTCACATA GTGCCACATT 

20 

51 GTAATTGGTA CAAGCTCCAT TTTCGTCAGC GCTGTTTGTT A.TGCTGCCGC 
101 CTACTTTTCC TTCGGCACTC CATAAGTTAA ACCCTGTCAT TATAAGTGTG 
25 151 ATTGCCGTAT CTCGGCTGAA TGGGTTCCAT TTTTCTCTTA. AATAATCACG 

201 TGTCCATATT CCATGTATTG TGTTCATGAG TATGTGATTC TCATCGTATA 
251 TCTTCGCCT 

30 

(4) INFORMATION FOR SEQ ID N°: 4 

35 (i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 170 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

40 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°4 : 

45 1 CCACTCGAAA.ATGGAGGCTT TGAAACATTT CAGTACCCCT GTGAACTCTG 

51 GCTTTGCAAT GTAACAGCAA AAACACTTAC AGTTGAAGGG TGCAGTGT'CA 
101 GACGCTATGG AAGTTGCATC CACGAGCACR ACCCTGATTA CTACTGGCCA 

50 

151 CGTTGCTRTC CGGGTCGTCC 
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(5) INFORMATION FOR SEQ ID N°: 5 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 168 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°5 : 

1 GTATGTTACC ATGTCCAACC CGGTTATTAA ATACACCAAG TCGTAGGATT 

51 TGTAGGCAGC TGCATTGCCC TTGACGTACT CTCTCAACGT TGCCAAGGAC 

101 TCAGGCCCAT AAATGTAGTG GGGTTGACCT TGAACTCTTC GTAAAAAGCG 

151 TTCTTTCTCC GTCGTGAG 

(6) INFORMATION FOR SEQ ID N°: 6 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 247 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°6 : 

1 CCGAAMATAA AACTTAGTCT CACCAATATA' CGTTTGCCTA ACGCGAAGGA 

51 AC AG G C AC AA ATATACTACG AG C AC G AC AT TCTCAAGAAC ACGGTTCACG 

101 GAGTGTGGAC GAGAATTCAC TCAAAATATC CGTTCCCTGA AGATGAGGGA 

151 ATTACACTGA TAATGACAGG GTTTGATTTA TGGAGTGCCG ATTTAACTGT 

201 AGGCGGCACC ATAACAAACA GCGCTGAGAA AAGCGGAGCT TGTACGA 
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(7) INFORMATION FOR SEQ ID N°: 7 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 35 1 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°7 : 



\ 


CTTCGTGGTG 


AGCTTAGTCA 


TTGTGGCCTG 


CATCGTGGTA 


GACACAGCCA 




ACCACAAAGG 


TAGAGGGCGG 


CCTGCGAAGT 


GTAAACTTCC 


TCCGGACGAC 


101 


GGACCATGCA 


GAGCACGAAT 


TCCGAGTTAC 


TACTTTGATA 


GAAAAACCAA 


151 


AACGTGCAAG 


GAGTTTATGT 


ATGGCGGATG 


CGAAGGAAAC 


G AAAAC AAT T 


201 


TTGAAAACAT 


AACTACGTGC 


CAAGAGGAAT 


G C AG AG C AAA 


AAAA G T C T AA 


251 


ACCAAGCCTT 


GAATGAAGAC 


TGCCTGAGAT 


CAT T C AAGAA 


GAATCTGTGC 


301 


TCAGTCAAAT 


AAGACAAAAA 


GTCAAATAAT 


AAAATAAG T T 


TAAAAAAATC 


351 


G 











(8) INFORMATION FOR SEQ ID N°: 8 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 292 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°8 : 

1 CATCGMAGCC ATAGTATATT TTGCACTTGT CTTCCGTTTC GTCGTAGTAG 

51 GACCGATTCC AC AT T G TAG T ACACCAGTCA CTTATATCCT GCGGGCGGTG 

101 CTTGCATTTG TCCTGAACAA ATCTTCCACA GCGCTTGTCG CACGCCTCCT 

151 G G G AAT AG AA CGCGTTCTCT CCTCCGCATC TCCATTTGGA AT CAT AG AAA 

201 CATCTTTCAG TTTGAATATT GTAGCGATAA TAATCGGTAT CAGTTTCTTT 

251 GCATGGTCCT GGGAGGGGTT TGGCGCAGGG GCCGTATTCA GG 



33 



10 



(9) INFORMATION FOR SEQ ID N°: 9 

(i) SEQUENCE CHARACTERISTICS : 

(a) > LENGTH : 270 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°9 : 

1 GGTAATAGTT GTCAAATTCC ATTAATGTAT CCTGAAATGT G AC C AT AT C T 

15 51 TTGTTTCCCC TGTAAAATCT CAT AAAAG G C TGTGTGTTTT C C T T AAG AAG 

101 TGTAACAGCC ACGATGGTCA ATCTCACGGA TGGATGTGTG ACACTTTTAT 

151 ATCTCAGGTT TGCCGACATT GCCATTACAG ATAAATAGTT GATAATTTCT 

201 TTCTTGTTAT AGTTGTAAGC AGCGCATGTT GTTGCATCAA G C AC C AC AT G 

251 CACTTCAGGC AATATGGTTT 



20 



25 



50 



(10) INFORMATION FOR SEQ ID N°: 10 



(i) SEQUENCE CHARACTERISTICS : 
30 (a) LENGTH: 316 base pairs 

. (b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

35 (ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°10 : 

1 AGAAAGCAGT CATATTGGCC ATCCACAGGT CACAATGGTT CTCTCCTTGA 

40 

51 CCTGGCATCG GGATTCGAAG TATGGTGCAG TTCACGTAGT TGGAATACAA 
101 CACGAAATGT GTTCGTTGGT ACGCCAATAG GGGTTCTCGC AAAGAACATA 
45 151 TCATTTGGAG GAAGGCGTAG TCCGTCGAGA TATCCCAAAA CTAGGGTTTC 

201 ATTGCGTGCG AACCAACTGC CCCCACTTCT GTATGTGTAC T G T AAG G AG T 
251 RGTTGAACGG YGTCCTCTTT CCCATAACCT TGAAGTTTTC ACACTGCAGA 
301 GGATTACCTC TCAAAA 
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(11) INFORMATION FOR SEQ ID N°: 1 1 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 241 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°l 1 : 

1 AAGGTAGCAA GGGTGGTAGG CTTTCCTCAC AAAGAGTCTG GCTTCCGTGA 

51 TAACCATATC CATTCCTCAC CGTATACCCG TCATCCAACG TCAATTGTGT 

101 T AC AAG G C AG ATAATGTCAA AATGGCTCTG GTCCCTATAA TAGTCGGATA 

151 ATGTAGAAAT CGCTCCATGT GGCCAAATAG ATGTTCCTCT TTCATACTGT 

201 TTTAACTTTA ATTGTAGGTC CGCCTCGTTC TCGAGGTATG T 

(12) INFORMATION FOR SEQ ID N°: 12 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 636 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°12 : 



1 


TTCCCCNAAT 


TGGCCTTGCG 


ANNCTTGCAA 


GTCGACNCTA 


GAGGCTCCGA 


, 51 


AGATGGACAG 


ATTGCGCATG 


AAATATTTGA 


AATCGAGCAG 


AATGGTGATT 


101 


TTAGGAGCGA 


TTATATTGTG 


CCACCCAGTT 


TGAAAGTGCA 


AGAACGCACA 


151 


GTGGTTTACC 


GTAACAAGTA 


CACCAGAGTT 


CCTGTAAATT 


TTACCGTCGA 


201 


AGTTGCCATG 


CTGATTGATA 


AG TAT T TATA 


CWAGGAGTTC 


AAG AAC GAGA 


251 


GCCACATCGT 


ACCGTACCTG 


GCTATGATAC 


TGACTTTGAT 


AAATCTGAGG 


301 


TATGCCGACA 


CACATGACCC 


GTACATCCAG 


TTTCTTCTCA 


CACAAGTGTT 


351 


CGTGGGGAAW 


WCTGGCGATC 


ATATGGGCCA 


CATGCCCTTC 


CGACGAGCGT 


401 


TCTTGTTCAG 


GCGCCGGCAT 


TATGCGCAGT 


TTAGGCCCAA 


TMACACCTTC 


451 


CACTTGTAAT 


TCTCCGTTGT 


TGGATAGTGT 


AAGTGAGGCC 


ATTGCATCAG 
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501 CATCGTGGAA GARGCCTTCC TCCAAGTAGG AACCGCCCAT TTAGGTTTGC 
551 TTTCCCAATC CGCCAATTTA ANTTTTAAAA AAAATTCCCC CCCCAAAAAT 
5 601 TAATTTTTTT TAAAGGTGGA TTGTGATTTC TCCGTT 



(13) INFORMATION FOR SEQ ID N c : 13 
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(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 432 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 
15 (d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°13 : 

20 

1 GATCCCAAAA GTGCCGCTGG ARCGACGGTT ACATCATGAG C T AC G T CAT A 

51 AACTTCAAAA ACCACTTCAA ATTTtCTCCC TGCTGTGTAG AAT C AAT T C G 

25 101 ATTCGTCGCA CGAGAGCGGG A.CTGCCTCTA CAAAGTCAAT GCCAAGGATG 

151 C T G T AAAAAG C C T AAT AT C T CTGCCCGGAT TTAGGATATC GCCAACGAGT 

201 TTCTGTCAAT TTATGCA.TCC GCTTTACCGC GGTGTCCATA GCGATAAGi^ 

30 

251 AGCAGGTCTG TCCGATTGCG TACAGACGTG TAGAACGGCC AAAAATCGAC 

301 GAGGAGGCTA CCATTCATGG ATTCACGCGG CACTTGACGG GGTTCCTTGC 

35 351 GACAAGAGAA ACCCCAAGAA GGCCTGCATA AACGGGAAAT GCACCCTCCT " 

401 TAAGAGCATG CCCCACAGAA CGTACCGGGA AT 



40 



50 



(14) INFORMATION FOR SEQ ID N°: 14 



(i) SEQUENCE CHARACTERISTICS : 
(a) LENGTH : 466 base pairs 
45 (b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 



(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°14 : 

1 AGGGCGTTCT TTGCTT YACA GGGAACRGCA TATGGGCCAC GTGACCTTCC 
55 51 AATGACCGCT CCAAATCTGG CATAGGTTGA AYTCGCAAGT CGTGGCGCAG 
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101 


CAGGCCTYCC 


ACATTCACTC 


CATCCTCGTC 


TTTTAGGATG 


ACTGCCGCCA 


151 






r^oo l vj x x J. \j a 






201 


TAAGTATTGA 


CCAACGATCG 


GCCGAATGAT 


TACGGCTCAC 


CAAACACATC 


251 


AAATACCCCC 


GTCAAGTCAA 


GAGCTGGAAG 


CACAAAGCAT 


AGTATGTACA 


301 


AGATACCCTT 


GGAAATCTTT 


CCCGAAGTTC 


ACCTTGTGGT 


GGACAGCACA 


351 


TTTGCCAAAG 


CTTTTAAATT 


TGACGTGTAC 


AAAGTAACGC 


GTTACTTCGC 


401 


AGTGCTTACA 


AATGCGGCTA 


ATCTTAGGTA 


TGCCAGCTTC 


GTATTTCCAA 


451 


AAGTACAGCT 


C AG GAT 









(15) INFORMATION FOR SEQ ID N°: 15 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 377 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ED N°15 : 

1 CTCGTCCACA CATTCTCCTA AAATGCAAGC CTTTTTTTTC CCACAAGGTG 

51 TACCGTCGAC TACACTGAGT CTCCAATAAA TATGTTTTCC GGTGCAATTT 

101 ACCTTGCAGT CTTTGACGCC GTATGTAGGG TCAGCGTGCA TGCCTTCGTC 

151 GTACATATAC ACCCTCTGAC AGTAGTTGCT GAGTGTTGTC ATCCTACCAG 

201 GAAGCTTAGA CGAACGTTTT ATTGTTTTTG TCGTGTATCG TTCTCTAAGG 

251 CATTTGAATT CCGGACGGTT GTAGAGGTTC CTGACTTCTC ' GCTGGCAGCA 

301 ATAAGAGAAC TGATACTGGC GCTCGTCTTG CATCTTGTAA CTCATGAGGT 

351 ATCCGTCATC CCATGGGCAG TCCGCAG 
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(16) INFORMATION FOR SEQ ID N°: 16 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 1670 base pairs 
5 (b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

10 

(iii) FEATURE : 

(a) NAME/KEY : CDS 

(b) LOCATION : 54 1520 

15 (iv) SEQUENCE DESCRIPTION : SEQ ID N°16 : 

1 AAGGAAGAAG TTAGGCGTAG GCTTTGGGAA ACCGGTCATC CTCGAAACCA GAG 



20 


54 


ATG 
Met 


TCG 
Ser 


GGA 
Gly 


CTC 
Leu 


AGC 
Ser 


.CTG 
Leu 


AAA 
Lys 


TTG 
Leu 


TGG 
Trp 


ATT 
He 


GTA 
Val 


GCG 
Ala 


TTC 
Phe 


TTT 
Phe 


TCT 
Ser 






99 


TTC 
Phe 


TGC 
Cys 


TTG 
Leu 


GCC 
Ala 


GAG 
Glu 


AAA 
Lys 


GAG 
Glu 


CA.T 
His 


GGG 
Gly 


ATC 
He 


GTG 
Val 


TAC 
Tyr 


CCC 
Pro 


AGG 
Arg 


ATG 
Met 


CTT 
Leu 


25 


147 


GAA 
Glu 


AGC 
Ser 


AGA 
Arg 


GCA 
Ala 


GCA 
Ala 


A.CT 
Thr 


GGA 
Gly 


GAG 
Glu 


AGA 
Arg 


ATG 
Met 


CTT 
Leu 


AAA 
Lys 


ATC 
He 


AAC 
Asn 


GAT 
Asp 


GAC 
Asp 


30 


195 


CTG 
Leu 


ACG 
Thr 


TTG 
Leu 


ACG 
Thr 


CTG 
Leu 


CAG 
Gin 


AAG 
Lys 


AGT 
Ser 


AAG 
Lys 


GTC 
Val 


TTC 
Phe 


GCT 
Ala 


GAC 
Asp 


GAC 
Asp 


TTT 
Phe 


CTC 
Leu 


243 


TTC 
Phe 


AGC 
Ser 


ACG 
Thr 


ACC 
Thr 


GAC 
Asp 


GGA 
Gly 


ATT 
He 


GAA 
Glu 


CCT 
Pro 


ATT 
He 


GAT 
Asp 


TAC 
Tyr 


TAC 
Tyr 


ATC 
He 


AAA 
Lys 


GCC 
Ala 


35 


291 


GAA 
Glu 


GAC 
Asp 


GCT 
Ala 


GAA 
Glu 


CGT 
Arg 


GAC 
Asp 


ATC 
He 


TAC 
Tyr 


CAC 
His 


GAC 
Asp. 


GCA 
Ala 


ACT 
Thr 


CAC 
His 


ATG 
Met 


GCA 
Ala 


TCA 
Ser 




339 


GTA 
Val 


AGG 
Arg 


GTA 
Val 


ACG 
Thr 


GAC 
Asp 


GAT 
Asp 


GAT 
Asp 


GGC 
Gly 


GTG 
Val 


GAA 
Glu 


GTG 
Val 


GAA 
Glu 


GGA 
Gly 


ATT 
lie 


CTT 
Leu 


GGA 
Gly 


40 


387 


GAG 
Glu 


AGG 
Arg 


CTT 
Leu 


CGT 
Arg 


GTT 
Val 


AAA 
Lys 


CCT 
Pre 


TTG 
Leu 


CCG 
Pro 


GCA 
Ala 


ATG 
Met 


GCC 
Ala 


CGC 
Arg 


AGC 
Ser 


AGC 
Ser 


GAT 
Asp 


45 


435 


GGC 
Gly 


CTC 
Leu 


AGA 
Arg 


CCG 
Pro 


CAT 
His 


ATG 
Met 


TTG 
Leu 


TAC 
Tyr 


GAA 
Glu 


GTC 
Val 


GAC 
Asp 


GCA 
Ala 


CAC 
His 


GAA 
Glu 


AAC 
Asn 


GGC 
Gly 


483 


CGG 
Arg 


CCA 
Pro 


CAT 

His 


GAT 
Asp 


TAT 
Tyr 


GGT 
Gly 


TCA 
Ser 


CCG 
Pro 


AAC 
Asn 


ACA 
Thr 


ACA 
Thr 


AAT 
Asn 


ACC 
Thr 


CCC 
Pro 


GTA 
Val 


GAG 
Glu 


50 


531 


AGA 
Arg 


AGA 
Arg 


GCT 
Ala 


GGA 
Gly 


GGC 
Gly 


ACA 
Thr 


GAA 
Glu 


CCC 
Pro 


CAG 
Gin 


ATG 
Met 


TAC 
Tyr 


AAG 
Lys 


ATA- 
He 


CCA 
Pro 


GCG 
Ala 


GAA 
Glu 




579 


ATC 
He 


TAT 
Tyr 


CCC 
Pro 


GAA 
Glu 


GTT 
Val 


TAC 
Tyr 


CTT 
Leu 


GTG 
Val 


GCG 
Ala 


GAT 
Asp 


AGT 
Ser 


GCC 
Ala 


TTT 
Phe 


GCC 
Ala 


AAA 
Lys 


GAA 
Glu 


55 


627 


TTT 
Phe 


AAC 
Asn 


TTT 
Phe 


GAT 
Asp 


GTG 
Val 


AAC 
Asn 


GCC 
Ala 


GTT 
Val 


ACG 
Thr 


CGT 
Arg 


TAC 
Tyr 


TTC 
Phe 


GCA 
Ala 


GTG 
Val 


CTT 
Leu 


ACA 
Thr 



• 
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67 5 AAT GCG GCT AAT CTT AGG TAT GAA AGC TTC AAA TCT CCA AAG GTA CAG 
Asn Ala Ala Asn Leu Arg Tyr Glu Ser Phe Lys Ser Pro Lys Val Gin 

72 3 CTC AGG ATC GTT GGC ATA ACG ATG AAC AAA AAC CCA GCA GAC GAG CCA 
Leu Arg lie Val Gly lie Thr Met Asn Lys Asn Pro Ala Asp Glu Pro 

771 TAC ATT CAC AAT ATA CGG GGA TAT GAG CAG TAC CGG AAT ATT TTG TTT 
Tyr lie His Asn lie Arg Gly Tyr Glu Gin Tyr Arg Asn lie Leu Phe 

819 AAG GAA ACA CTG GAG GAT TTC AAC ACT CAG ATG AAG TCA AAA CAT TTT 
Lys Glu Thr Leu Glu Asp Phe Asn Thr Gin Met Lys Ser Lys His Phe 

8 67 TAT CGT ACT GCC GAT ATC GTG TTT CTC GTG ACA GCA AAA AAT ATG TCC 
Tyr Arg Thr Ala Asp lie Val Phe Leu Val Thr Ala Lys Asn Met Ser 

915 GAA. TGG GTT GGT AGC ACA CTA CAA TCA TGG ACT GGC GGG TAC GCT TAC 
Glu Trp Val Gly Ser Thr Leu Gin Ser Trp Thr Gly Gly Tyr Ala Tyr 

20 963 GTA GGA ACA GCG TGT TCC GAA TGG AAA GTA GGA ATG TGT GAA GAC CGA 
Val Gly Thr Ala Cys Ser Glu Trp Lys Val Gly Met Cys Glu Asp Arg 



10 



15 



1011 CCG ACA AGC TAT TAC GGA GCT TAC GTT TTC GCC CAT GAG CTG GCG CAT 
Pro Thr Ser Tyr Tyr Gly Ala Tyr Val Phe Ala His Glu Leu Ala His 

105 9 AAT TTG GGT TGT CAA CAC GAT GGA GAT GGT GCC AAT AGC TGG GTG AAA 
Asn Leu Gly Cys Gin His Asp Gly Asp Gly Ala Asn Ser Trp Val Lys 

1107 GGG CAC ATC GGA TCT GCG GAC TGC CCA TGG GAT GAC GGA TAC CTT * ATG 
Gly His lie Gly Ser Ala Asp Cys Pro Trp Asp Asp Gly Tyr Leu Met 

115 5' AGC TAC AAG ATG GAA GAC GAG CGC CAG TAT AAG TTT TCT CCC TAC TGC 
Ser Tyr Lys Met Glu Asp Glu Arg Gin Tyr Lys Phe Ser Pro Tyr Cys 

35 120 3 CAG AG A GAA GTC AGG AAC CTC TAC AGG CGT CCG GAA TTC AAA TGC CTC 
Gin Arg Glu Val Arg Asn Leu Tyr Arg Arg Pro Glu Phe Lys Cys Leu 



25 



30 



40 



45 



50 



55 



1251 ACT GAA CGA AAA GCG AAA AAA ACA ATC CGC TCG TCT AAG CTA CCT GGT 
Thr Glu Arg Lys Ala Lys Lys Thr He Arg Ser Ser Lys Leu Pro Gly 

12 99 GTG ATG ACA TCA TCG AGC AAC TAT TGC CGG AGG GTG TAC ATG TAC GAA 

Val Met Thr Ser Ser Ser Asn Tyr Cys Arg Arg Val Tyr Met Tyr Glu 

134 7 AAA GGC ATG CAC GCC GAC GAG GCA TaT GGC GTC AAG GAC TGC AGG GTA 
Lys Gly Met His Ala Asp Glu Ala Tyr Gly Val Lys Asp Cys Arg Val 

13 9 5 AAA TGC ACC ACC ACA TCA AGA ATG TAT TGG CTA CTC GGT GTA GTC GAC 

Lys Cys Thr Thr Thr Ser Arg Met Tyr Trp Leu Leu Gly Val Val Asp 

14 4 3 GGT ACA CCT TGC GGA AAT GGA AAG GCT TGC ATT CTT GGG AAA TGC AGG 

Gly Thr Pro Cys Gly Asn Gly Lys Ala Cys He Leu Gly Lys Cys Arg 

14 91 AAC AAA ATC AAA ATA AGC AAG AAG GAC TGA GAGGTTGATA ATATCAAATT 
Asn Lys He Lys He Ser Lys Lys A.sp End 

154 1 AATCATGATA TTTCAACCAC ATGACTTCGT GCTCAACTGG TAGCCCCAAA TAAATTTTAA 
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1601 AAAAAATCCC AATATGCGTG GTAGAAAAAG CAGCAAACAA TAAAACTTCT AAAAATGTCT 
1661 TGCAAAAATG 

5 

(17) INFORMATION FOR SEQ ID N°: 17 

(i) SEQUENCE CHARACTERISTICS : 
10 (a) LENGTH : 158 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

15 (ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°17 : 

1 CACCAGTGAT GCTTATTGTT GCACTGCACT TGTTGATAAT ATCCGGTCGT 

20 

51 CGAATTGCAC TTCGGAACTT CCACTCCAAC TTGGCGAGCC GTGGATTTTG 
101 ACTTCTCGTG ATGCTCCACC AGACAGTTGC AGGACTTCAG CTGCCTAGAT 
25 151 GGAGCCTT 
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(18) INFORMATION FOR SEQ ID N°: 18 



(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 146 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 
35 (d) TOPOLOGY ; linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°18 : 

40 

1 CTGTTGTTGA ACTGAAATAA ATAACAAAAA AATCATAAAG NTGGAGGAAA 
51 GATGATCGAN TCCCCGCCCC TTGACAATCG TCCGATAAAA ACCAACTATA 
45 101 TTCNGTCCTT T T T AC AAAC A ATTCCAANTG' TCTGACCGAA CCGCGA 
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(19) INFORMATION FOR SEQ ID N°: 19 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 140 base pairs 

5 (b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

10 

(iii) SEQUENCE DESCRIPTION : SEQ ID N° 1 9 : 

1 CTNGGACGAN GTCCTATGAC TTGCGCTTAN GTTTCTTAGT CTTCTTCGGT 
15 51 TTCTTCTTTT TTTGCTTCGG TTTTTCGGTG GGCGCAGGTG TATAGTCATC 

101 AGTGTCGGTG GGCCCATCCG AATGAGTTGT CAAATGACAT 



20 

(20) INFORMATION FOR SEQ ID N°: 20 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 143 base pairs 

25 (b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

30 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°20 : 

1 TGCCGAAAAA TAACGATGAT TTGACGTTGA CTCTGCAGAA GAGTAAGGTT 
35 51 TTCACCGACA GTTTTCTGTT TAGCACGACG AAGGATAACG AGCCTATCGA 

101 TTACTACGTG AGAGCCGAAG ATGCCGAACG AGACATATAT CAC 



40 

(2 1 ) INFORMATION FOR SEQ ID N°: 2 1 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 140 base pairs 

45 (b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

50 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°21 : 

1 TGTTGCTACA GACTCGACGT TTCGAGCTTG CTCGCCATTT MAAGACAACG 
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51 CACTCACAGA ATATTTAAGT GCGTTCGTGA WAGCTGTGGG CTTACGATTG 
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101 CAGGCGCTTC ANTCACCAGC TGTGATATTA MAGTTCCTAG 



5 (22) INFORMATION FOR SEQ ID N°: 22 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 144 base pairs 

(b) TYPE : nucleic acid 

10 (c) STRAND EDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

15 (iii) SEQUENCE DESCRIPTION : SEQ ID N°22 : 

1 TCACGATAGT TGAAACGTTG AAACTTGAAA TACTCCCACA GTCGTTGGAT 
51 GCTTCAGAAC TGCTAAGAAC TTCACACTTT GCAAGAAGTW CCAAAATGAA 

20 

101 AGCCGCGATG ACCGATGATT TAGCTTCCAT CTTCTATCAC TTGA 



25 (23) INFORMATION FOR SEQ ID N°: 23 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 95 base pairs 

(b) TYPE : nucleic acid 

30 (c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

35 (iii) SEQUENCE DESCRIPTION : SEQ ID N°23 : 

1 GACCACCCCG TCCGAACTTG CTAAAKCAAG CAATGGAGTG AGGTGTTCTA 
51 TGCGGGTTGA TTACACCAAT GGCGCTGCGT GGTGCGTGGT GATTT 

40 



• 
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(24) INFORMATION FOR SEQ ID N 3 : 24 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 1414 base pairs 

5 (b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

10 

(iii) FEATURE : 

(a) NAME/KEY : CDS 

(b) LOCATION : 143 .. 1276 

15 (iv) SEQUENCE DESCRIPTION : SEQ ID N°24 : 

1 GTAGGGCCGT GCAAGCGAAG GCAGCGAAGG CTGCGAGTGT ACGTGCAGTT CGGAAGTGCA 

61 ATATCCTGTT A.TTAAGCTCT AAT TAG C AC A CTGTGAGTCG ATCAGAGGCC TCTCTTA^CG 

20 

121 CCACATTGAA AAAGGATCCA AG ATG GAG GCA AGT CTG AGC AaC CAC ATC CTT 

Met Glu Ala Ser Leu Ser Asn Kis lie Leu 



25 


173 


AAC 
Asn 


TTC 
Phe 


TCC 
Ser 


GTC 
Val 


GAC 
A.sp 


CTA 
Leu 


TAC 
Tyr 


AAG 
Lys 


CAG 
Gin 


CTG 
Leu 


AAA 
Lys 


CCC 
Pro 


TCC 
Ser 


GGC 
Gly 


AAA 
Lys 


GAC 
Asp 




221 


ACG 
Thr 


GCA 
Ala 


GGA 
Gly 


AAC 
Asn 


GTC 
Val 


TTC 
Phe 


TGC 
Cys 


TCA 
Ser 


CCA 
Pro 


TTC 
Phe 


AGT 
Ser 


ATT 
He 


GCA 
Ala 


GCT 
Ala 


GCT 
Ala 


CTG 
Leu 


30 


269 


TCC 
Ser 


ATG 
Met 


GCC 
Ala 


CTC 
Leu 


GCA 
Ala 


GGA 
Gly 


GCT 
Ala 


AGA 
A.rg 


GGC 
Gly 


AAC 
Asn 


ACT 
Thr 


GCC 
Ala 


AAG 
Lys 


CAA 
Gin 


ATC 
He 


GCT 
Ala 


35 


317 


GCC 
Ala 


ATC 
lie 


CTG 
Leu 


CAC 
His 


TCA 
Ser 


AAC 
Asn 


GAC 
Asp 


GAC 
Asp 


AAG 
Lys 


ATC 
He 


CAC 
His 


GAC 
Asp 


CAC 
His 


TTC 
Phe 


TCC 
Ser 


AAC 
Asn 


365 


TTC 
Phe 


CTT 
Leu 


TGC 
Cys 


AAG 
Lys 


CTT 
Leu 


CCC 
Pro 


AGT 
Ser 


TAC 
Tyr 


GCC 
Ala 


CCA 
Pro 


GAT 
Asp 


GTG 
Val 


GCC 
Ala 


CTG 
Leu 


CAC 
His 


ATC 
He 


40 


413 


GCC 
Ala 


AAT 
Asn 


CGC 
Arg 


ATG 
Met 


TAC 
Tyr 


TCT 
Ser 


GAG 
Glu 


CAG 
Gin 


ACC 
Thr 


TTC 
Phe 


CAT 
His 


CCG 
Pro 


AAA 
Lys 


GCG 
Ala 


GAG 
Glu 


TAC 
Tyr 




461 


ACA 
Thr 


ACC 
Thr 


CTG 
Leu 


TTG 
Leu 


CAA 
Gin 


AAG 
Lys 


TCC 
Ser 


TAC 
Tyr 


GAC 
Asp 


AGC 
Ser 


ACC 
Thr 


ATC 
He 


AAG 
Lys 


GCT 
Ala 


GTT 
Val 


GAC 
Asp 


45 


509 


TTT 
Phe 


GCA 
Ala 


GGA 
Gly 


AAT 
Asn 


GCC 
Ala 


GAC 
Asp 


AGG 
Arg 


GTC 
Val 


CGT 
Arg 


CTG 
Leu 


GAG 
Glu 


GTC 
Val 


AAT 
Asn 


GCC 
Ala 


TGG 
Trp 


GTT 
Val 


50 


557 


GAG 
Glu 


GAA 
Glu 


GTC 
Val 


ACC 
Thr 


AGG 
Arg 


TCA 
Ser 


AAG 
Lys 


ATC 
He 


AGG 
Arg 


GAC 
Asp 


CTG 
Leu 


CTC 
Leu 


GCA 
Ala 


CCT 
Pro 


GGA 
Gly 


ACT 
Thr 


605 


GTT 
Val 


GAT 
Asp 


TCA 
Ser 


TCG 
Ser 


ACA 
Thr 


TCA 
Ser 


CTT 
Leu 


ATA 
He 


TTA 
Leu 


GTG 
Val 


AAT 
Asn 


GCC 
Ala 


ATT 
He 


TAC 
Tyr 


TTC 
Phe 


AAA 
Lys 


55 


653 


GGT 
Gly 


CTG 
Leu 


TGG 
Trp 


GAT 
Asp 


TCT 
Ser 


CAG 
Gin 


TTC 
Phe 


AAG 
Lys 


CCT 
Pro 


AGT 
Ser 


GCT 
Ala 


ACG 
Thr 


AAG 
Lys 


CCG 
Pro 


GGA GAT 
Gly Asp 
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701 TTT CAC TTG ACA CCA CAG ACC TCA AAG AAA GTG GAC ATG ATG CAC CAG 

Phe His Leu Thr Pro Gin Thr Ser Lys Lys Val Asp Met Met His Gin 

74 9 GAA GGG GAC TTC AAG ATG GGT CAC TGC AGC GAC CTC AAG GTC ACT GCG 

5 Glu Gly Asp Phe Lys Met Gly His Cys Ser Asp Leu Lys Val Thr Ala 

7 97 CTT GAG ATA CCC TAC AAA GGC AAC AAG ACG TCG ATG GTC ATT CTC CTG 

Leu Glu lie Pro Tyr Lys Gly Asn Lys Thr Ser Met Val lie Leu Leu 

10 84 5 CCC GAA GAT GTA GAG GGA CTC TCA GTC CTG GAG GAA CAC TTG ACC GCT 

Pro Glu Asp Val Glu Gly Leu Ser Val Leu Glu Glu His Leu Thr Ala 



8 93 CCG AAA CTG TCG GCT CTG CTC GGC GGC ATG TAT GCG ACG TCC GAT GTC 

Pro Lys Leu Ser Ala Leu Leu Gly Gly Met Tyr Ala Thr Ser Asp Val 

15 

94 1 AAC TTG CGC TTG CCG AAG TTC AAA CTA GAG CAG TCC ATA GGT TTG AAG 

Asn Leu Arg Leu Pro Lys Phe Lys Leu Glu Gin Ser lie Gly Leu Lys 



98 9 GAT GTA CTG ATG GCG ATG GGA GTC AAG GAT TTC TTC ACG TCC CTT GCA 

20 Asp Val Leu Met Ala Met Gly Val Lys Asp Phe Phe Thr Ser Leu Ala 

10 57 GAT CTT TCT GGC ATC AGC GCT GCG GGG AAT CTG TGC GCT TCG GAT GTC 

Asp Leu Ser Gly lie Ser Ala Ala Gly Asn Leu Cys Ala Ser Asp Val 

25 103 5 ATC CAC AAG GCT TTT GTG GAA GTT AAT GAG GAG GGC ACA GAG GCT GCA 

lie His Lys Ala Phe Val Glu Val Asn Glu Glu Gly Thr Glu Ala Ala 



1133 GCT GCC ACT GCC ATA CCC ATT ATG TTG ATG TGT GCG AGA TTT CCA CAG 

Ala Ala Thr Ala lie Pro lie Met Leu Met Cys Ala Arg Phe Pro Gin 

30 

1181 GTG GTG AAC TTT TTC GTT GAC CGC CCA TTC ATG TTC TTG ATC CAC AGC 

Val Val Asn Phe Phe Val Asp Arg Pro Phe Met Phe Leu lie His Ser 



122 9 CAT GAT CCA GAT GTT GTT CTC TTC ATG GGA TCC ATC CGT GAG CTC TAA 
35 His Asp Pro Asp Val Val Leu Phe Met Gly Ser lie Arg Glu Leu End 

127 7 AAAGCATATT CTTAACGGCG GCCAATCAGT CTGTGGAGTT ATCTCTTAGT CACTAATGTG 



13 37 TAACAATTCT GCAATATTCA GCTTGTGTAT TTCAGTAACT TGCTAGATCT TTGTGTTGTT 

40 

1397 GATGTTAGGC TTCTTGCG 



44 



10 



(25) INFORMATION FOR SEQ ID N°: 25 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 200 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°25 : 

1 ACCGTAACCA AAATTGTTTC TTTCCAGAAG AATGGTTCAA AC T T T T C AAA 

15 51 CAGATTTCGG AAACTCTTCT TGCACTTTTA AAAT CC AAT C TACAATCTTT 

101 CCTCGCACTT CTGAATTGCA TTCCAGTTTA CCTTCCAAGC AAACCTCTTT 

151 TGGCAACTCC AGCCGTACTC CATTTCGGCA TACCACAGTG CATGCACTTG 

20 

(26) INFORMATION FOR SEQ ID N°: 26 

25 (i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 241 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

30 

(ii) MOLECULE TYPE : cDNA 

(iii) SEQUENCE DESCRIPTION : SEQ ID N°26 : 

35 1 CGTATTCTTT GAAGATTTGT ATACGAAACA TAAATTCGTC ATGCATACTT 

51 TTGATGGTTA CACGACATGC GPJ^GCTGCCG ACAAAGAAGA CTGGGAAGAT 
101 AAGAAGCACC TAGTTACGGT AGTGCGTGGA CCGGATAAAC GAAAGTACAC 

40 

151 GTTTCTACGC AACATTCTCA CCTTACAACG GAGAGTGAGA GTTAGCAAAA 
201 CAATGATTGA GCTCGTACGG AACATGTCCT GTAGGACATT T 

45 
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(27) INFORMATION FOR SEQ ID N°: 27 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 313 base pairs 

5 (b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

10 

(iii) SEQUENCE DESCRIPTION : SEQ ID N c 27 : 





1 


AAGCANCCGG 


ACTACCTGCT 


TGAAAACGTT 


GTACGGGCAA 


ACTTGGACGG 


15 * 


5 1 


AAAACTCCCA 


GATGCTACTC 


CAGTTCCTCC 


CGGAAGCTAC 


ACGTACGCTG 




101 


AGAATGATAA 


CTTCACCTGC 


TATTCCAGAA 


GTACACCGTT 


TCCGGATGGG 


20 


151 


GTGAATGTTG 


TATAACGGCT 


GCTGGGTGCG 


GAAGACTATG 


AT G GAT T AC G 


201 


CAAAAAAGTT 


CTAAACGAGT 


TGTTTCCCAT 


CCCGGAAAGT 


CTGCTGTATG 




251 


CTGACATGAT 


GCGACTTGTG 


GCTAAGAAAG 


ACAGAGTTGA 


T C AC AC T AG T 


25 


301 


GGATGACCTG 


GGA 









(28) INFORMATION FOR SEQ ID N°: 28 

30 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH: 2417 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS :. single 
35 (d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) FEATURE : 

40 (a) NAME/KEY : CDS 

(b) LOCATION : 218 1495 

(iv) SEQUENCE DESCRIPTION : SEQ ID N°28 : 

45 1 GTCGTAGTCG TAGTCGTAGT CAGTTGCGCA TGCGCGGGGC TTTCCTGTCT TTCTTGCCTT 

61 TCTGCAGTCG TTCACCAACA TGTGGATACA GCTCCGGAGA TTTGTAAACA AATACTGCAC 
121 TTTTAAGCAA GACTTGATAT T T AG AT CG AT ATCCTCCTGT TGTCCGTCTT GATTAATCGG 

50 

181 CTCTTTAGGG TTTTTAGAAT AGGCTTTTCG GTACGAG ATG CCC AAA GGA AAG AGG 

Met Pro Lys Gly Lys Arg 

236 GGA CCC AAA GCA GGT GGC GCC GCG CGC GGT GGC CGG TGC GAG GCC AGC 
55 Gly Pro Lys Ala Gly Gly Ala Ala Arg Gly Gly Arg Cys Glu Ala Ser 
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284 CTG 
Leu 

332 GTG 
5 Val 

38 0 GGA 
Gly 

10 428 AAA 
Lys 

476 CGG 
Arg 

15 

52 4 CTG 
Leu 

57 2 GAG 
20 Glu 

620 ACT 
Thr 

25 6 68 GAC 
A.sp 

716 GAC 
Asp 

30 

7 64 GGC 

Gly 

812 CCG 
35 Pro 

8 60 GGC 

Gly 

40 908 CTC 
Leu 

956 CTG 
Leu 

45 

1004 ATT 
He 

1052 TAC 
50 Tyr 

1100 CGC 
Arg 

55 1148 CGC 
Arg 



GCT CCG TCg 
Ala Pro Ser 

CTG AGC TGC 
Leu Ser Cys 

GAC CCA GAA 
Asp Pro Glu 

CTC AAG GAG 
Leu Lys Glu 

CTG TCG TGC 
Leu Ser Cys 

TAC GAC TTC 
Tyr Asp Phe 

CGT GGG GTG 
Arg Gly Val 

CTC GGG GCC 
Leu Gly Ala 

GCC CTG TTC 
Ala Leu Phe 

GGG GCC CAG 
Gly Ala Gin 

CTC TGC TGc 
Leu Cys Cys 

TGC ATG GAG 
Cys Met Glu 

TCT CCG GTG 
Ser Pro Val 

AGC GTG GCT 
Ser Val Ala 

CCC CGG CTG 
Pro Arg Leu 

GCG GCC GGG 
Ala Ala Gly 

GAC GAG GAC 
Asp Glu Asp 

CAG CTG GCC 
Gin Leu Ala 

CAG CAG CGC 
Gin Gin Arg 



TCC AGC GAC 
Ser Ser Asp 

GCC TCG GAG 
Ala Ser Glu 

GCG GAG 'GAG 
Ala Glu Glu 

GCC ATC GAC 
Ala He Asp 

CTG GAG GCG 
Leu Glu Ala 

CTC ATG GAC 
Leu Met Asp 

CGC AAG GGC 
Arg Lys Gly 

CTG GCC TGC 
Leu Ala Cys 

GAC GCC CTG 
Asp Ala Leu 

GGG CCC TCC 
Gly Pro Ser 

TTC GTG GTG 
Phe Val Val 

GTG CTC TGG 
Val Leu Trp 

CTC CAG GCA 
Leu Gin Ala 

CCC GTC GAC 
Pro Val Asp 

CAG GAG CTG 
Gin Glu Leu 

GAG GTG ATC 
Glu Val He 

TTT GAG GAG 
Phe Glu Glu 

ACG GAC AGC 
Thr Asp Ser 

TCC ACC TTC 
Ser Thr Phe 



GAG GAG TCC 
Glu Glu Ser 

TCT CGC TGT 
Ser Arg Cys 

GCT GTG CTG 
Ala Val Leu 

GGA GCT TCG 
Gly Ala Ser 

ATT CGC AAG 
He A.rg Lys 

AGA CCG AGC 
Arg Pro Ser 

CGA GGG GAG 
Arg Gly Glu 

GTC CAG CTC 
Val Gin Leu 

CGC CAG CCG 
Arg Gin Pro 

CCC AGG GCC 
Pro Arg Ala 

GAC TCG GAC 
Asp Ser Asp 

CAG GTG GTG 
Gin Val Val 

GCG GCC CTG 
Ala Ala Leu 

CGc CTG CTG 
Arg Leu Leu 

CTG GAG AGC 
Leu Glu Ser 

GCC GTC ATG 
Ala Val Met 

CCC TCG GAG 
Pro Ser Glu 

CAG AAG TTT 
Gin Lys Phe 

AGG GAC GTC 
Arg Asp Val 



AAC GCA GAC 
Asn Ala Asp 

GGC AGT GAC 
Gly Ser Asp 

CAT GAC GAC 
His Asp Asp 

CAG AAG AGT 
Gin Lys Ser 

GCC TTT TcC 
Ala Phe Ser 

ACG GTG TGC 
Thr Val Cys 

GAG GCG GCC 
Glu Ala Ala 

GGG GTC GGG 
Gly Val Gly 

CTC TGC ACT 
Leu Cys Thr 

AGG TGT GCC 
Arg Cys Ala 

AAC CAG CTG 
Asn Gin Leu 

GGT GCC AAG 
Gly Ala Lys 

CTC GCC TGG 
Leu Ala Trp 

GCG CTc ACG 
Ala Leu Thr 

CCC GAC CTG 
Pro Asp Leu 

TAC GAG GGG 
Tyr Glu Gly 

TCC CTG TGT 
Ser Leu Cys 

CGG GCC AAG 
Arg Ala Lys 

TAC CGG GCC 
Tyr Arg Ala 



ACG GCG AGC 
Thr Ala Ser 

GGC ACC GTT 
Gly Thr Val 

TTT GAA GAC 
Phe Glu Asp 

GCC AAA GGA 
Ala Lys Gly 

ACC AAA TAC 
Thr Lys Tyr 

GAC CTG GTG 
Asp Leu Val 

CTG TGC GCC 
Leu Cys A.la 

GCC GAG GCG 
Ala Glu Ala 

TTG CTG CTT 
Leu Leu Leu 

ACT GCC CTC 
Thr A.la Leu 

GTG CTG CAG 
Val Leu Gin 

GCG GGC CCC 
Ala Gly Pro 

GGC CTC CTG 
Gly Leu Leu 

CGC ACG CAC 
Arg Thr His 

GAC CTG CGC 
Asp Leu Arg 

GCC AGG GAC 
Ala Arg Asp 

GCC CAG CTG 
A.la Gin Leu 

AAG GAG CGG 
Lys Glu Arg 

GTC AGG GAG 
Val Arg Glu 
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1196 GGG GCC TCT CCC GAC GTG AGC GTC AAG TTT GGC CGG GAA GTC CTG GAA 

Gly Ala Ser Pro Asp Val Ser Val Lys Phe Gly Arg Glu Val Leu Glu 

12 4 4 CTG GAC ACC TGG AGT CGC AAG CTG CAG TAC GAC GCT TTC TGC CAG CTG 

5 Leu Asp Thr Trp Ser Arg Lys Leu Gin Tyr Asp Ala Phe" Cys Gin Leu 

12 92 CTG GGC TCC GGC ATG AAC CTG CAC CTG GCC GTG AAC GAG CTG CTG AGG 

Leu Gly Ser Gly Met Asn Leu His Leu Ala Val Asn Glu Leu Leu A.rg 

10 134 0 GAC ATC TTT GAA CTG GGG CAG GTG CTG GCA ACC GAG GAC CAC ATT ATC 

Asp lie Phe Glu Leu Gly Gin Val Leu Ala Thr Glu Asp His lie lie 

133 8 TCC AAG ATC ACC AAG TTC GAA AGG CAC ATG GTG AAC ATG GCC AGC TGC 

Ser Lys lie Thr Lys Phe Glu Arg His Met Val Asn Met Ala Ser Cys 



15 



14 36 CGG GCC CGC ACC AAG ACA CGC AAC CGG CTG AGG GAC AAG CGC GCC GAC 
Arg Ala Arg Thr Lys Thr Arg Asn Arg Leu Arg Asp Lys Arg Ala Asp 



14 84 GTG GTC GCC TGA ACCTGCGGAG GGATGCTTAG CTATGCACTC GCCGGCCTAC 
20 Val Val Ala End 

153 6 CCTGGCGGGA CTCGATGCCA CTCACGAGTC GGCGCTCGCA AATTCGCCGC CCATCGTTAC 

15 96 GCAATGGGAG ACAAAGCTGC TTTTGGCATT ACC G TTT GAG GTCGGCTCCA ACCCATAGAT 

25 

165 6 GAATTTCTTT TTTGTGGCCG TTTCTGGGTT ACATGTTTTG GGGGAAGGGA GTGGAACTGT 

1716 CCGGTTCTTT GGCACACGTC AGGTTGCTCT TGATGCGCGA CGTGCTTGTA TTTGGGtACT 

30 177 6 GCCGACACCA AGCGTTTCGG C GAT TCC TGG AAAAGAGTGC CTCTCGCTCG ACGTTTGGTT 

1836 GTTTTCTGCG TGGTCCGTCG TCGACCTTCG TTCGTCCAAA GACGCCGTCC GGTTTCAtAC 

18 96 TCCCCCCCGC ACA.CATATCG AGGCCAATTA AATTGCTAAG GGTGCCGTTG TCGTGCATCT 

35 

195 6 GGCAGGCTCA GAAGTGGCTT ATTTGTCTTT TAATTTTGCC GATGCACGCA AAAAT T G T C A 

2016 TTTCTTGAAA GTTTCTCTTT TATTGCGTAC ACAATTCAAC TTTTATGTAA TTTCTGATGG 

40 2 07 6 TCTGTTTTAC GTGTGCGTGT GTAAAACGTA AC TT TGG AAG AATTTTTATG CACACTGAAC 

2136 AAACGCTC GG TCCTGGGGTT GAAAGTGCTC GGTGTGTGCA TGAGCTAAAG TGCAACTGCT 

2196 TTGTTCCGAA GGTTTTCTAG TCGCCGAAAT GTACCATTGT GGACCTTGTT GCGAGAGACC 

45 

2256 TTGGTCTTCT GGGGGAGCTG CTGTAGCGTG GCAAGCCACT ATTTTGGGAG CGACATTGCA 

2316 GAG AAAAT CG GCTTTTAGAA AGGCACCTGC GCGGCGAGTG " GACGTTTTTT CG TAT AT ACT 
50 2 37 6 G C G AAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 
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(29) INFORMATION FOR SEQ ID N°: 29 

(i) SEQUENCE CHARACTERISTICS : 

(a) LENGTH : 933 base pairs 

(b) TYPE : nucleic acid 

(c) STRANDEDNESS : single 

(d) TOPOLOGY : linear 

(ii) MOLECULE TYPE : cDNA 

(iii) FEATURE : 

(a) NAME/KEY : CDS 

(b) LOCATION : 32 S53 

15 (iv) SEQUENCE DESCRIPTION : SEQ ID N°29 : 

1 GATTGGGAAC CTCCTATTCC TCACTTGAAA C ATG GCT GGA CTC CGC TCC 

Met A.la Gly Leu Arg Ser 

20 



25 



30 



35 



40 



50 


TGC 


ATC 


CTC 


CTG 


GCT 


CTT 


GCC 


ACT 


AGT 


GCC 


TTC 


GCC 


GGC 


TAC 


CTT 


CAC 




Cys 


lie 


Leu 


Leu 


Ala 


Leu 


Ala 


Thr 


Ser 


Ala 


Phe 


Ala 


Gly 


Tyr 


Leu 


His 


98 


GGT 


GGC 


CTT 


ACC 


CAC 


GGC 


GCT 


GGG 


TAC 


GGT 


TAC 


GGT 


GTC 


GGC 


TAC 


GGT 




Gly 


Gly 


Leu 


Thr 


His 


Gly 


Ala 


Gly 


Tyr 


Gly 


Tyr 


Gly 


Val 


Gly 


Tyr 


Gly 


146 


TCC 


GGC 


CTT 


GGC 


TAT 


GGC 


CTT 


GGC 


TAC 


GGT 


TCC 


GGC 


CTT 


GGC 


TAT 


GGA 




Ser 


Gly 


Leu 


Gly 


Tyr 


Gly 


Leu 


Gly 


Tyr 


Gly 


Ser 


Gly 


Leu 


Gly 


Tyr 


Gly 


194 


CAT 


GCT 


GTT 


GGC 


CTT 


GGA 


CAC 


GGC 


TTT 


GGC 


TAT 


TCT 


GGT 


CTG 


ACC 


GGC 




His 


Ala 


Val 


Gly 


Leu 


Gly 


His 


Gly 


?he 


Gly 


Tyr 


Ser 


Gly 


Leu 


Thr 


Gly 


242 


TAG 


AGT 


GTG 


GCT 


GCC 


CCA 


GCT 


AGC 


TAC 


GCC 


GTT 


GCT 


GCT 


CCA 


GCC 


GTC 




Tyr 


Ser 


Val 


Ala 


Ala 


Pre 


Ala 


Ser 


Tyr 


Ala 


Val 


Ala 


Ala 


Pro 


Ala 


Val 


290 


AGC 


CGC 


ACC 


GTT 


TCC 


ACT 


TAC 


CAC 


GCT 


GCT 


CCA 


GCT 


GTG 


GCC 


ACC 


TAC 




Ser 


Arg 


Thr 


Val 


Ser 


Thr 


Tyr 


His 


Ala 


Ala 


Pro 


Ala 


Val 


Ala 


Thr 


Tyr 


338 


GCC 


GCT 


GCT 


CCT 


GTC 


GCC 


ACC 


TAT 


GCT 


GTT 


GCT 


CCA. 


GCT 


GTC 


ACT 


AGG 




Ala 


Ala 


Ala 


Pro 


Val 


Ala 


Thr 


Tyr 


Ala 


Val 


Ala 


Pro 


Ala 


Val 


Thr 


Arc 


386 


GTT 


TCC 


CCC 


GTT 


CGC 


GCC 


GCC 


CCA 


GCT 


GTG 


GCC 


ACG 


TAC 


GCC 


GCC 


GCT 




Val 


Ser 


Pro 


Val 


Arg 


Ala 


Ala 


Pro 


Ala 


Val 


Ala 


Thr 


Tyr 


Ala 


Ala 


Ala 



434 CCA 
45 Pro 

4 82 ATT 
He 

50 530 ACC 
Thr 

578 GCT 
Ala 

55 

62 6 ACT 



GTC GCC ACC 
Val Ala Thr 

CAC GCT GCC 
His Ala Ala 

TAT GCC GCT 
Tyr Ala Ala 

CCA GCC GTG 
Pro Ala Val 

GTG GCT CAT 



TAC GCC GCT 
Tyr Ala A.la 

CCG GCT GTG 
Pro Ala Val 

GCT CCA GCT 
Ala Pro Ala 

GCT AGC TAC 
Ala Ser Tyr 

GCT CCA GCT 



GCT CCA GCT 
Ala Pro Ala 

GCC AAT TAC 
Ala Asn Tyr 

GTG ACC AGG 
Val Thr Arg 

CAG ACC TAC 
Gin Thr Tyr 

GTG GCC AGC 



GTG ACC AGG 
Val Thr Arg 

GCC GTC GCT 
Ala Val Ala 

GTG TCC ACC 
Val Ser Thr 

CAC GCT CCA 
His Ala Pro 

TAC CAG ACC 



GTG TCC ACC 
Val Ser Thr 

CCA GTC GCC 
Pro Val Ala 

ATC CAC GCC 
He His Ala 

GCT GTC GCC 
Ala Val Ala 

TAC CAC GCT 
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Thr 


Val 


Ala 


His 


Ala 


Pro 


Ala 


Val 


Ala 


Ser 


Tyr 


Gin 


Thr 


Tyr 


His 


Ala 


674 


GCC 


CCA 


GCC 


GTG 


GCT 


ACC 


TAC 


GCC 


CAT 


GCC 


GCT 


CCC 


GTC 


TAC 


GGC 


1 TAT 




Ala 


Pro 


Ala 


Val 


Ala 


Thr 


Tyr 


Ala 


His 


Ala 


Ala 


Pro 


Val 


Tyr 


Gly 


Tyr 


722 


GGT 


GTC 


GGT 


ACC 


CTC 


GGA 


TAT 


GGT 


GTC 


GGC 


CAC 


TAC 


GGC 


TAC 


GGA 


CAC 




Gly 


Val 


Gly 


Thr 


Leu 


Gly 


Tyr 


Gly 


Val 


Gly 


His 


Tyr 


Gly 


Tyr 


Gly 


His 


770 


GGT 


CTT 


GGC 


AGC 


TAC 


GGC 


CTG 


AAC 


TAC 


GGT 


TAC 


GGC 


CTC 


GGC 


ACC 


TAC 




Gly 


Leu 


Gly 


Ser 


Tyr 


Gly 


Leu 


Asn 


Tyr 


Gly 


Tyr 


Gly 


Leu 


Gly 


Thr 


Tyr 


318 


GGT 


GAC 


TAC 


ACC 


ACC 


CTT 


CTC 


CGC 


AAG 


AAG 


AAG 


TAA 


ATGGCA CATCTCAAGA 




Gly 


Asp 


Tyr 


Thr 


Thr 


Leu 


Leu 


Arg 


Lys 


Lys 


Lys 


End 











15 8 70 GAGCCCATTG GACTGCCATC GACATTCTTC TTCAATAAAA GAGCCCGAAG ATGGCATTAT 
S30 TTTT 
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CLAIMS 

1. A method for characterising genes which are induced in 
the salivary glands of a tick during the tick feeding phase which 
comprises : 

a) selectively cloning mRNAs induced during the tick 
feeding phase to obtain a corresponding cDNA library ; 

b) cloning full-length cDNAs corresponding to som - e *f k**f 
incomplete cDNA sequence^identified in the library obtained in step 
a). 

2. The method of claim 1, wherein said tick is a Ixodes 

ricinus tick. 

3. The method of claim 2, wherein the genes induced are 
induced during the slow-feeding phase of the blood meal. 

4. The method of any of claims 1-3, which comprises : 

a) synthesising uninduced cDNAs starting from mRNAs 
expressed in the salivary gland of unfed ticks ; 

b) synthesising induced cDNAs starting from mRNAs 
expressed in the salivary gland of fed ticks ; 

c) subtracting said uninduced cDNAs from said induced 

cDNAs ; 

d) isolating and cloning specifically induced cDNAs, thus 
obtaining a sub^tractive library ; 



e) obtaining corresponding full-length induced cDNA ; 
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f) sequencing and comparing said full-length induced 
DNA molecules with known-polypept ide and polynucleotide 
sequences. 

5 5. The method of claim 4, wherein a full-length cDNA 

library is set up and screened by means of incomplete cDNAs isolated 
from the sub\tractive library. 
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6. The method of claim 4, which comprises : 

a) randomly sequencing a number of clones of said 
sub^tractive library ; 

b) comparing their DNA and amino acid translated 
15 sequences with DNA and protein databases ; 

c) identifying distinct family sequences ; 

d) characterising their corresponding full-length mRNA 

20 sequence. 

7. A polynucleotide isolated^from tick salivary glands, ^ 
which encodes a tick salivary gland polypeptide^and fragments 
thereof, and any closely related or complementary polynucleotide. 

25 & mbfc^U 

8. The polynucleotide of claim 7, whenever obtained^from 
Ixodes ricinus salivary glands. 

9. The polynucleotide of claim 7, which is complementary 
30 to a Ixodes ricinus salivary gland cDNA. 

10. Tke polynucleotide^^ claims iS-a^wl* 9 having a 
nucleotide sequence selected from the group consisting of seq. id. no. 
1, seq. id. no. 2, seq. id. no, 3, seq. id. no. 4, seq. id. no. 5, seq. id. 

35 no. 6, seq. id. no. 7, seq. id. no. 8, seq. id. no. 9, seq. id. no. 10, seq. 
id. no. 11, seq. id. no. 12, seq. id. no. 13, seq. id. no. 14, seq. id. no. 
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15, seq. id. no. 16, seq. id. no. 17, seq. id. no. 18, seq. id. no. 19, seq. 
id. no. 20, seq. id. no. 21, seq. id. no. 22, seq. id. no. 23, seq. id. no. 
24, seq. id. no. 25, seq. id. no. 26, seq. id. no. 27, seq. id. no. 28, seq. 
id. no. 29 or a sequence complementary thereto^ o<- * (U^^~f- t^c^f . 

1 1. The polynucleotide of any of claims 7-10, further 
including a polynucleotide having over its entire length at least 75% 
identity to a nucleotide sequence encoding the Ixodes ricinus salivary 
gland polypeptide encoded by a polynucleotide as defined in claim 10. 

12. The polynucleotide of any of claims 7-10, further 
including a polynucleotide comprising a nucleotide sequence that is at 
least 75% identical with a nucleotide sequence as defined in claim 10. 

13. The polynucleotide of any of claims 7-10 further 
including a nucleotide sequence which has sufficient identity to a 
nucleotide sequence as defined in claim 10 to hybridize under 
conditions useable for amplification or for use as a probe or marker. 

14. An isolated polypeptide encoded by the 
polynucleotide of claims 7-13. 

15. An immunological composition or vaccine for 
inducing an immunological response in a mammalian host to a tick 
salivary gland polypeptide which comprises at least one member of the 
group consisting of 

a) a tick salivary gland cDNA as defined in any of claims 

7-14 ; 

b) a tick salivary gland polypeptide as defined in 

claim 14 ; 

c) epitope-bearing fragments, analogs, outer-membrane 
vesicles or cells (attenuated or otherwise) of components (a) or (b) ; 
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d) possibly a carrier. 

16. A therapeutics agent having anticoagulant properties 
containing at least one polypeptide encoded by a polynucleotide 

5 having a nucleotide sequence selected from the group consisting of 
seq. id. no. 7, seq. id. no. 16, seq. id. no. 24 } i^y^^h t^-^of • 

17. A therapeutics agent having immunomodulatory 
properties containing at least one polypeptide encoded by a 

10 polynucleotide having a nucleotide sequence selected from the group 
consisting of seq. id. no. 11, seq. id. no. 17, seq. id. no. 19, seq. id. 
no. 28 y a*d seq. id. no. 29, 

18. A therapeutics agent as claimed in any of claims 16 
15 and 17 for use alone or in combination with an anti-tick vaccine, 

among others as defined in claim 15, to prevent the transmission of 
pathogens carried by the ticks. 

19. A polynucleotide which is identical or sufficiently 
20 identical to a nucleotide sequence selected from the group consisting 

of seq. id. no. 1, seq. id. no. 2, seq. id. no. 3, seq. id. no ; 4, seq. id. 
no. 5, seq. id. no. 6, seq. id. no. 7, seq. id. no. 8, seq. id. no. 9, seq. 
id. no. 10, seq. id. no. 11, seq. id. no. 12, seq. id. no. 13, seq. id. no. 
14, seq. id. no. 15, seq. id. no. 16, seq. id. no. 17, seq. id. no. 18, seq 

25 id. no. 19, seq. id. no. 20, seq. id. no. 21, seq. id. no. 22, seq. id. no. 
23, seq. id. no. 24, seq. id. no. 25, seq. id. no. 26, seq. id. no. 27, seq 
id. no. 28, seq. id. no. 29 or a sequence complementary thereto, or a 
fragment thereof, for use as a hybridisation probe for cDNA clones 
encoding tick, more particularly Ixodes ricinus, salivary gland 

30 polypeptides ; or for isolating clones of other genes similar to tick 
salivary gland cDNAs. 

20. A diagnostic kit for a disease or susf^e^t^bility to a 
disease which comprises : 
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(a) a tick salivary gland polynucleotide, preferably the 
nucleotide sequence of one of the gene sequences defined in claim 10, 
or a fragment thereof, 

(b) a nucleotide sequence complementary to that of(a); 
5 (c) a tick salivary gland polypeptide, preferably the 

polypeptide encoded by one of the gene sequences defined in claim 10, 
or a fragment thereof; 

(d) an antibody to a tick salivary gland polypeptide, 
preferably to the polypeptide encoded by one of the gene sequences 

10 defined in claim 10; or 

(e) a phage displaying an antibody to a tick salivary gland 
polypeptide, preferably to the polypeptide encoded by one of the 
cDNAs sequences defined in claim 10 whereby (a), (b), (c), (d) or (e) 
may comprise a substantial component. 
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FIGURE 2. 



